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* Figure 1A < 

1 CACCCTATCC TACACTACTA GGAACTTGCA CAGTCCGCCT CGGGC. AAAGCTCCTC 60 

I - - . 

61 TGCCCACCCT GGCTCC(JaAA ACCCTCCAAA ACAAAAGACC AGAAAAGCAC l\_- TrrArrrA no 



121 GCAGCCAAAC GCCTCCTTOT TGACGCCAGC CCCCACCCTC TGTCTGCTCG AGCCCAGojva 180 
181 AGGCCTGAAG GAACAGGCCG GGGAAGGAGC CCTCCCTCTC TCCCTTGTCC CTCCATCCAC 240 



241 CCAGCGCCGG CATCTGGAGA CCCTATGGCC CGGGCTCACT GGGGCTGCTG CCCCTGGCTG 300 
1 MA RAH W GCC PWL 12 



301 GTCCTCCTCT GTGCTTGTGC C TGGGGCC AC ACAAAGCCAC TGGACCTTGG AGGGCAGGAT 360 
13VLLC ACA WGH TKPL DLG GQD 32 



361 GTGAGAAATT GTTCCACCAA CCCCCCTTAC CTTCCAGTTA CTGTGGTCAA TACCACAATG 420 
33VRNC STN PPY LPVT VVN TTM 52 

421 TCACTCACAG CCCTCCGCCA GCAGATGCAG AC C C AGAATC TCTCAGCCTA CATCATCCCA 480 
53SLTA LRQ QMQ TQNL SAY IIP 72 

481 GGCACAGATG CTCACATGAA CGAGTACATC GGCCAACATG ACGAGAGGCG TGCGTGGATT 540 
73 G T D A HMN EYI GQHD ERR AWI 92 



541 ACAGGCTTTA CAGGGTCTGC AGGAACTGCA GTGGTGAC T A TGAAGAAAGC AGCTGTCTGG 600 
^ T G F T GSA GTA V VTM KKA AVW 112 

601 ACCGACAGTC GCTACTGGAC TCAGGCTGAG CGGCAAATGG ACTGTAATTG GGAGC TCC AT 660 
113 T D S R Y W T QAE RQMD CN W ELH 132 

661 AAGGAAGTTG GCACCACTCC T ATTGTC AC C TGGCTCCTCA CCGAGATTCC CGC TGGAGGG 720 
133 KEVG TTP IVT WLLT EIP AGG 152 

721 CGTGTGGGTT TTGACCCCTT CCTCTTGTCC ATTGACACCT GGGAGAGTTA TGATCTGGCC 780 
153 RVGF DPF LLS IDT W ESY DLA 172 

7 81 CTCCAAGGCT CTAACAGACA GCTGGTGTCC ATCACAACCA ATCTTGTGGA CCTGGTATGG 840 
173 L Q G S NRQ LVS ITTN LVD LVW 192 

841 GGATCAGAGA GGCCACCGGT TCCAAATCAA CCCATTTATG CCCTGCAGGA GGCATTCACA 900 
193GSER PPV PNQ PIYA LQE AFT 212 



901 GGGAGCACTT GGCAGGAGAA AGTATCTGGC GTCCGAAGCC AGATGCAGAA GCATCAAAAG 960 
213GSTW QEK VSG VRSQ MQK HQK 232 
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Figure IB 



961 GTCCCGACTG CCGTCCTTCT GTCGGCGCTT G AGG AG AC GG CCTGGCTCTT CAACCTTCGA 102 0 
233 VPTA VLL SAL EETA WLF NLR 252 



1021 GCCAGTGACA TCCCCTATAA CCCCTTCTTC TATTCCTACA CGCTGCTCAC AGACTCTTCT 1080 
253 ASDI PYN PFF YSYT LLT DSS 272 



10 81 ATTAGGTTGT TTGCAAACAA GAGTCGCTTT AGCTCCGAAA CCTTGAGCTA TCTGAACTCC 1140 
273 IRLF ANK SRF SSET LSY LNS 292 



1141 AGTTGCACAG GCCCCATGTG TGTGCAAATC GAGGATTACA GCCAAGTTCG TGACAGCATC 12 00 

293 SCTG PMC V Q I EDYS QVR DSI 312 

12 01 CAGGCCTACT CATTGGGAGA TGTGAGGATC TGGATTGGGA CCAGCTATAC CATGTATGGG 12 60 

313 QAYS L G D VRI WIGT SYT MYG 332 

12 61 ATCTATGAAA TGATACCAAG GGAGAAACTC GTGACAGACA CCTACTCCCC AGTGATGATG 132 0 

333 IYEM IPR EKL VTDT YSP VMM 352 

1321 ACCAAGGCAG TGAAGAACAG CAAGGAGCAG GCCCTCCTCA AGGCCAGCCA CGTGCGGGAC 1380 

353 TKAV KNS KEQ ALLK ASH VRD 372 



13 81 GCTGTGGCTG TGATCCGGTA CTTGGTCTGG CTGGAGAAGA ACGTGCCCAA AGGCACAGTG 1440 

373 AVAV IRY LVW LEKN VPK GTV 392 

1441 GATGAGTTTT CGGGGGCAGA GATCGTGGAC AAGTTCCGAG GAGAAGAACA GTTCTCCTCC 1500 

393 DEFS GAE IVD KFRG EEQ FSS 412 

1501 GGACCCAGTT TTGAAACCAT CTCTGCTAGT GGTTTGAATG CTGCCCTGGC CCACTACAGC 1560 

413 GPSF ETI SAS GLNA ALA HYS 432 



15 61 CCGACCAAGG AGCTGAACCG CAAGCTGTCC TCAGATGAGA TGTACCTGCT GGACTCTGGG 162 0 
433 PTKE LNR KLS SDEM YLL DSG 452 



1621 GGGCAGTACT GGGAC GGGAC CACAGACATC ACCAGAACAG TCCACTGGGG CACCCCCTCT 1680 

453 GQYW DGT TDI TRTV HWG TPS 472 

16 81 GCCTTTCAGA AGGAGGCATA TACCCGTGTG CTGATAGGAA AT AT TGAC C T GTCCAGGCTC 1740 

473 AFQK EAY TRV LIGN IDL SRL 492 

1741 ATCTTTCCCG C T GC T AC AT C AGGGCGAATG GTGGAGGCCT TTGCCCGCAG AGCCTTGTGG 1800 

493 IFPA ATS GRM VEAF ARR ALW 512 



1801 GATGC TGGTC TCAATTATGG TCATGGGACA GGCCACGGCA TTGGCAACTT CCTGTGTGTG 18 60 
513 DAGL NYG HGT GHGI GNF LCV 532 
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Figure 1C 

1861 CATGAGTGGC C AGTGGG AT T CCAGTCCAAC AACATCGCTA TGGCCAAGGG CATGTTCACT 192 0 

533 HEWP VGF QSN NIAM AKG MFT 552 

1921 TCCATTGAAC CTGGTTACTA TAAGGATGGA GAATTTGGGA TCCGTCTCGA AGATGTGGCT 19 80 

553 SIEP GYY KDG EFGI RLE D V A 572 

1981 CTCGTGGTAG AAGCAAAGAC CAAGTACCCA GGGGAGCTAC CTGACCTTGT GGTATCATTT 2 040 

573 LVVE AKT KYP GELP DLV VSF 592 

2 041 GTGCCCTATG ACCGGAACCT CATCGATGTC AGCCTGCTGT CTCCCGAGCA TCTCCAGTAC 2100 

593 VPYD R N L IDV SLLS PEH L Q Y 612 

2101 CTGAATCGCT AC T AC C AGAC CAT C C GGG AG AAGGTGGGTC CAGAGCTGCA GAGGCGCCAG 2160 

613 LNRY YQT IRE KVGP ELQ RRQ 632 

jk-i 2161 CTACTAGAGG AGTTCGAGTG GCTTCAACAG CACACAGAGC CCCTGGCCGC CAGGGCCCCA 2 220 

Q 633 LLEE FEW LQQ HTEP L A A RAP 652 

9 

2 221 GACACCGCCT CCTGGGCCTC TGTGTTAGTG GTCTCCACCC TTGCCATCCT TGGCTGGAGT 22 80 

4/1 653 DTAS WAS VLV VSTL AIL GWS 672 



22 81 GTCTAGAGGC TCCAGACTCT CCTGTTAACC CTC CATC TAG ATGGGGGGCT CCCTTGCTTA 2340 
673 V * 673 

2341 GCTCCCCTCA CCCTGCACTG AACATACCCC AAGAGCCCCT GCTGGCCCAT TGCCTAGAAA 2400 

2401 CCTTTGCATT CATCCTCCTT CTCCAAGACC TATGGAGAAG GTCCCAGGCC C C AGGAAAC A 2460 

2461 CAGGGCTTCT TGGCCCCAGA TGGCACCTCC CTGCACCCCG GGGTTGTATA CCACACCCTG 2 520 

2 521 GGCCCCTAAT CCCAGGCCCC GAAATAGGAA AGC C AGC TAG TCTCTTCTCT TCTGTGATCT 2 580 

2 581 CAGTAGGCCT AACCTATAAC CTAACACAGA CTGCTACAGC TGCTCCCCTC CCGCCAAACA 2 640 

2 641 AAGCCCCAAG AAAACAATGC CCCTACCACC CAAGGGTGCC ATGGTCCCGG GAAAACCCAA 2 7 00 

2701 CCTGTCACCG CGTGTTGGGC GTAACCAGAA CTGTTCCCCC CCACCAGGGC TTAAAAATCG 2 7 60 

27 61 CCCCCACTTT TTAACCATCG TCCATTAACC AC C T GGTGGG CAT AGC C AGA GCTGTTCGAA 2 82 0 

2 821 CCCAGCCAGG GAT G AAAAAT CAACCCCCGA CATGGAACCC ATGATTCCTA AACCCGGGGT 2 880 



2 881 AGGTTCCATG CCAAGTAACA GCAGAGGGAG TTAAGCCATA GGAATTTGGC TGTGGAGTAA 2 940 
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Figure ID 

2 941 GAGGGAATGC GGTGAGGCAG TGTGGAATAT GACCCTACCA GAGGT TGGAG AACAAACTTG 3 000 
3001 GGCAGCCGGA ACCCGTCACT ATTTTAGATT CCTGGCATTC GAGGAGCCCT TTGAACTTTC 3060 

3 0 61 CAAAGTGCAG CCACAGCTAC AATGC TGTT A AATCCTCCCA CATTTCTTGG ATGCCCCTTC 312 0 
3121 ACCTTGTGTG GACAGTGTCT GGTTTCCCCA TTTTACAGAC AGGAAAACTG AGCTTCAGAC 3180 
3181 AGGGGGT GGG CTTTGCCTAA GGACACACAA ATTTGGTTGG GAGTTGATGG GGCCAGATGA 3240 
3241 GCCAGCATTC CAGCTGTTTC ACCCTTCAGC AACATGCAGA GTCCCTGAGC CCACCTCCCA 33 00 
3301 GCCCTCTCCT CATTCTCTGA ACCCACTGTG GTGAGAAGAA TTTGCTCCGG CCAAATTGGC 33 60 
3 3 61 CGTTAGCCAC CTGGGTCCAC ATCCTGCTAA GACGTTTAAA ACAGCCTAAC AAAGAC AC TT 342 0 
3421 GCCTGTGG 3428 
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Figure 2A 

1 CACCCTATCC TACACTACTA GGAACTTGCA CAGTCCGCCT CGGGCAGCCC AAAGCTCCTC 60 
61 TGCCCACCCT GGCTCCCAAA ACCCTCCAAA AC AAAAG AC C AGAAAAGCAC TCTCCACCCA 12 0 
121 GCAGCCAAAC GCCTCCTTCT TGACGCCAGC CCCCACCCTC TGTCTGCTCG AGCCCAGGAA 180 
181 AGGCCTGAAG GAACAGGCCG GGGAAGGAGC CCTCCCTCTC TCCCTTGTCC CTCCATCCAC 240 



2 41 CCAGCGCCGG CAT C T GGAG A CCCTATGGCC CGGGCTCACT GGGGCTGCTG CCCCTGGCTG 300 
1 MA RAHW GCC PWL 12 



3 01 GTCCTCCTCT GTGCTTGTGC C TGGGGC C AC ACAAAGCCAC TGGACCTTGG AGGGCAGGAT 3 60 

13VLLC ACA WGH TKPL DLG GQD 32 

3 61 GTGAGAAATT GTTCCACCAA CCCCCCTTAC CTTCCAGTTA CTGTGGTCAA T AC C AC AATG 42 0 

33VRNC STN PPY LPVT V V N TTM 52 

421 TC AC TC AC AG CCCTCCGCCA GCAGATGCAG ACCCAGAATC TCTCAGCCTA CATCATCCCA 480 

53SLTA LRQ QMQ TQNL SAY IIP 72 

481 GGCACAGATG CTCACATGAA C GAGT AC AT C GGCCAACATG AC GAGAGGC G TGCGTGGATT 540 

73GTDA HMN EYI GQHD ERR A W I 92 

541 ACAGGCTTTA CAGGGTCTGC AGGAACTGCA GTGGTGAC T A TGAAGAAAGC AGCTGTCTGG 600 

93TGFT GSA GTA VVTM KKA AVW 112 

601 ACCGACAGTC GCTACTGGAC TCAGGCTGAG CGGCAAATGG ACTGTAATTG GGAGCTCCAT 660 

113 TDSR YWT Q A E R Q M D CNW ELH 132 

661 AAGGAAGTTG GCACCACTCC TATTGTCACC TGGCTCCTCA CCGAGATTCC CGCTGGAGGG 72 0 

133 KEVG TTP IVT WLLT EIP AGG 152 

721 CGTGT GGGT T TTGACCCCTT CCTCTTGTCC ATTGAC AC CT GGGAGAGTTA TGATCTGGCC 780 

153 RVGF DPF LLS IDTW ESY DLA 172 

7 81 CTCCAAGGCT CTAACAGACA GCTGGTGTCC ATCACAACCA ATCTTGTGGA CCTGGTATGG 840 

173 LQGS NRQ LVS ITTN LVD LVW 192 

841 GGATC AGAGA GGCCACCGGT TCCAAATCAA CCCATTTATG CCCTGCAGGA GGCATTCACA 900 

193 GSER PPV PNQ PIYA LQE AFT 212 



901 GGGAGCACTT GGCAGGAGAA AGTATCTGGC GTCCGAAGCC AGATGCAGAA GCATCAAAAG 960 
213 GSTW QEK VSG VRSQ MQK HQK 232 
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Figure 2B 



961 GTCCCGACTG CCGTCCTTCT GTCGGCGCTT G AGG AG AC GG CCTGGCTCTT CAACCTTCGA 102 0 
233 VPTA V Li L SAL EETA WLF NLR 252 



1021 GCCAGTGACA TCCCCTATAA CCCCTTCTTC TATTCCTACA CGCTGCTCAC AGACTCTTCT 1080 
253 ASDI PYN PFF YSYT LLT DSS 272 



1081 ATTAGGTTGT TTGCAAACAA GAGTCGCTTT AGCTCCGAAA CCTTGAGCTA TCTGAACTCC 1140 
273 I R L F ANK SRF SSET LSY LNS 292 



1141 AGTTGCACAG GCCCCATGTG TGTGCAAATC GAGGATTACA GCCAAGTTCG TGACAGCATC 1200 

293 SCTG PMC VQI EDYS QVR DSI 312 

12 01 CAGGCCTACT CATTGGGAGA TGTGAGGATC TGGATTGGGA CCAGCTATAC CATGTATGGG 12 60 

313 QAYS L G D VRI WIGT SYT MYG 332 

M 12 61 ATCTATGAAA TGATACCAAG GGAGAAACTC GTGACAGACA CCTACTCCCC AGTGATGATG 132 0 

Q 333 IYEM IPR EKL VTDT YSP VMM 352 



1321 ACCAAGGCAG TGAAGAACAG CAAGGAGCAG GCCCTCCTCA AGGCCAGCCA CGTGC GGG AC 13 80 
353 TKAV KNS KEQ ALLK ASH VRD 372 



ii ?f 
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•vi- : 

m ...... 

fft 1381 GCTGTGGCTG TGATCCGGTA CTTGGTCTGG CTGGAGAAGA ACGTGCCCAA AGGCACAGTG 1440 

V * 373 AVAV IRY LVW LEKN VPK GTV 392 

8 

t: . 
£*i 

fU 1441 GATGAGTTTT CGGGGGCAGA GATCGTGGAC AAGTTCCGAG GAGAAGAACA GTTCTCCTCC 1500 

i'U 393 DEFS GAE IVD KFRG EEQ FSS 412 

W ...... 

Q 1501 GGACCCAGTT TTGAAAC CAT CTCTGCTAGT GGTTTGAATG CTGCCCTGGC CCACTACAGC 1560 

lufc 413 GPSF ETI SAS GLNA ALA HYS 432 

1561 CCGACCAAGG AGCTGAACCG CAAGCTGTCC TCAGATGAGA TGTACCTGCT GGACTCTGGG 162 0 
433 PTKE LNR KLS SDEM YLL DSG 452 



1621 GGGCAGTACT GGG AC GGGAC C AC AG AC AT C ACCAGAACAG TCCACTGGGG CACCCCCTCT 1680 
453 GQYW DGT TDI TRTV HWG TPS 472 



1681 GCCTTTCAGA AGGAGGCATA TACCCGTGTG CTGATAGGAA AT AT TGAC C T GTCCAGGCTC 1740 
473 AFQK EAY TRV LIGN IDL SRL 492 



1741 ATCTTTCCCG C T GC T AC AT C AGGGCGAATG GTGGAGGCCT TTGCCCGCAG AGCCTTGTGG 1800 
493 IFPA ATS GRM VEAF ARR ALW 512 



1801 GATGCTGGTC TCAATTATGG TCATGGGACA GGCCACGGCA TTGGCAACTT CCTGTGTGTG 1860 
513 DAGL NYG HGT GHGI GNF LCV 532 
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18 61 CATGAGTGGC CAGTGGGATT 
533 H E W P V G F 

1921 TCCATTGAAC CTGGTTACTA 
553 S I E P G Y Y 

1981 C TCGTGGT AG AAGCAAAGAC 
573 L V V E A K T 

2 041 GTGCCCTATG ACCGGAACCT 
593 V P Y D R N L 

2101 CTGAATCGCT ACTACCAGAC 
613 L N R Y Y Q T 

2161 CTACTAGAGG AGTTCGAGTG 
633 L L E E FEW 

2221 GACACCGCCT CCTGGGCCTC 
653 D T A S WAS 

22 81 GTCTAGAGGC TCCAGACTCT 
673 V * 

2 341 GCTCCCCTCA CCCTGCACTG 

2401 CCTTTGCATT CATCCTCCTT 

2461 CAGGGCTTCT TGGCCCCAGA 

2521 GGCCCCTAAT CCCAGGCCCC 

2 5 81 CAGTAGGCCT AACCTATAAC 

2 641 AAGCCCCAAG AAAACAATGC 

2701 CCTGTCACCG CGTGTTGGGC 

2761 CCCCCACTTT TTAACCATCG 

2 821 CCCAGCCAGG GAT G AAAAAT 

2 8 81 AGGTTCCATG CCAAGTAACA 



Figure 2C 

CCAGTCCAAC AACATCGCTA 
Q S N N I A M 

TAAGGATGGA GAATTTGGGA 
K D G E F G I 

CAAGTACCCA GGGGAGC T AC 
K Y P G E L P 

CATCGATGTC AGCCTGCTGT 
I D V S L L S 

CATCCGGGAG AAGGTGGGTC 
IRE K V G P 

GCTTCAACAG CACACAGAGC 
L Q Q H T E P 

TGTGTTAGTG GTCTCCACCC 
V L V V S T L 

CCTGTTAACC CTC CATC TAG 

AACATACCCC AAGAGCCCCT 
CTCCAAGACC TATGGAGAAG 
TGGCACCTCC CTGCACCCCG 
GAAATAGGAA AGC C AGC TAG 
CTAACACAGA CTGCTACAGC 
CCCTACCACC CAAGGGTGCC 
GTAACCAGAA CTGTTCCCCC. 
TCCATTAACC ACCTGGTGGG 
CAACCCCCGA CATGGAACCC 
GCAGAGGGAG TTAAGCCATA 



TGGCCAAGGG CATGTTCACT 192 0 
A K G M F T 552 

TCCGTCTCGA AGATGTGGCT 1980 
RLE D V A 572 

CTGACCTTGT GGTATCATTT 2 040 
D L V V S F 592 

CTCCGGAGCA TCTCCAGTAC 2100 
P E H L Q Y 612 

CAGAGCTGCA GAGGC GC C AG 2160 
E L Q R R Q 632 

CCCTGGCCGC CAGGGCCCCA 2 22 0 
L A A RAP 652 

TTGCCATCCT TGGC TGGAGT 22 80 
AIL G W S 672 

ATGGGGGGCT CCCTTGCTTA 2340 

673 

GCTGGCCCAT TGCCTAGAAA 2400 
GTCCCAGGCC CCAGGAAACA 2460 
GGGTTGTATA CCACACCCTG 2 52 0 
TCTCTTCTCT TCTGTGATCT 2 580 
TGCTCCCCTC CCGCCAAACA 2 640 
ATGGTCCCGG GAAAACCCAA 2700 
CCACCAGGGC TTAAAAATCG 2760 
CAT AGC C AG A GCTGTTCGAA 2 820 
ATGATTCCTA AACCCGGGGT 2 880 
GGAATTTGGC TGTGGAGTAA 2 940 
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Figure 2D 

2 941 GAGGGAATGC GGTGAGGCAG TGTGGAATAT GACCCTACCA GAGGTTGGAG AACAAACTTG 3 000 

3 001 GGCAGCCGGA ACCCGTCACT AT TT TAG AT T CCTGGCATTC GAGGAGCCCT TTGAACTTTC 3060 
3 061 CAAAGTGCAG CCACAGCTAC AATGC TGTT A AATCCTCCCA CATTTCTTGG ATGCCCCTTC 312 0 
3121 ACCTTGTGTG GACAGTGTCT GGTTTCCCCA TTTTACAGAC AGGAAAACTG AGCTTCAGAC 3180 
3181 AGGGGGTGGG CTTTGCCTAA GGACACACAA ATTTGGTTGG GAGTTGATGG GGCCAGATGA 3240 
3241 GCCAGCATTC CAGCTGTTTC ACCCTTCAGC AACATGCAGA GTCCCTGAGC CCACCTCCCA 3 300 
3 3 01 GCCCTCTCCT CATTCTCTGA ACCCACTGTG GTGAGAAGAA TTTGCTCCGG CCAAATTGGC 3 3 60 
3361 CGTTAGCCAC CTGGGTCCAC ATCCTGCTAA GACGTTTAAA ACAGCCTAAC AAAGACACTT 342 0 
3421 GCCTGTGG 3428 
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Figure 3A 

1 CTGTGCATGG CATCATCCTG GCCCCCTCTA GAGCTCCAAT CCTCCAACCA GAGCCAGCTC 60 
1 MA SSW PPL ELQS SNQ SQL 18 

61 TTCCCTCAAA ATGCTACGGC CTGTGACAAT GCTCCAGAAG CCTGGGACCT GCTGCACAGA 12 0 
19FPQN ATA CDN APEA WDL LHR 38 

121 GTGCTGCCGA CATTTATCAT CTCCATCTGT TTCTTCGGCC TCCTAGGGAA CCTTTTTGTC 18 0 
39VLPT FII SIC FFGL LGN LFV 58 

181 CTGTTGGTCT TCCTCCTGCC CCGGCGGCAA CTGAACGTGG CAGAAATCTA CCTGGCCAAC 240 
59LLVF LLP RRQ LNVA EIY LAN 78 

241 CTGGCAGCCT CTGATCTGGT GTTTGTCTTG GGCTTGCCCT TCTGGGCAGA GAATATCTGG 3 00 
79LAAS DLV FVL GLPF WAE N I W 98 

3 01 AACCAGTTTA ACTGGCCTTT CGGAGCCCTC CTCTGCCGTG TCATCAACGG GGTCATCAAG 3 60 
99NQFN WPF GAL LCRV ING V I K 118 

3 61 GCCAATTTGT TCATCAGCAT CTTCCTGGTG GTGGCCATCA GCCAGGACCG CTACCGCGTG 42 0 
119 ANLF ISI FLV VAIS QDR YRV 138 

421 CTGGTGCACC C T ATGGC C AG C GGAAGGC AG CAGCGGCGGA GGCAGGCCCG GGTCACCTGC 480 
139LVHP MAS GRQ QRRR QAR VTC 158 

481 GTGCTCATCT GGGTTGTGGG GGGCCTCTTG AGCATCCCCA CATTCCTGCT GCGATCCATC 540 
159 VLIW VVG GLL SIPT FLL RSI 178 

541 CAAGCCGTCC CAGATCTGAA CATCACCGCC TGCATCCTGC TCCTCCCCCA TGAGGCCTGG 600 
179 QAVP DLN I T A CILL LPH EAW 198 

601 CACTTTGCAA GG AT TGTGG A GTTAAATATT CTGGGTTTCC TCCTACCACT GGCTGCGATC 660 
199 HFAR IVE LNI LGFL LPL A A I 218 

661 GTCTTCTTCA ACTACCACAT CCTGGCCTCC CTGCGAACGC GGGAGGAGGT CAGCAGGACA 72 0 
219 VFFN YHI LAS LRTR EEV SRT 238 

721 AGAGTGCGGG GGCCGAAGGA TAGCAAGACC ACAGCGCTGA TCCTCACGCT CGTGGTTGCC 7 80 
239 RVRG PKD SKT TALI LTL VVA 258 

7 81 TTCCTGGTCT GCTGGGCCCC TTACCACTTC TTTGCCTTCC TGGAATTCTT ATTCCAGGTG 840 
259 FLVC WAP YHF FAFL EFL FQV 278 

841 CAAGCAGTCC GAGGCTGCTT' TTGGGAGGAC TTCATTGACC TGGGCCTGCA ATTGGCCAAC 9 00 
279 QAVR GCF WED FIDL GLQ LAN 298 
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Figure 3B 

901 TTCTTTGCCT TCACTAACAG CTCCCTGAAT CCAGTAATTT ATGTCTTTGT GGGCCGGCTC 960 

299 FFAF TNS SLN PVIY VFV GRL 318 

9 61 TTCAGGACCA AGGTC TGGG A ACTTTATAAA CAATGCACCC CTAAAAGTCT TGCTCCAATA 102 0 

319 FRTK VWE LYK QCTP KSL API 338 

1021 TCTTCATCCC ATAGGAAAGA AATCTTCCAA CTTTTCTGGC GGAATTAAAA CAGCATTGAA 1080 

339 SSSH RKE IFQ LFWR N* 353 



1081 CC 1082 
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1 CTGTGCATGG CATCATCCTG 
1 M A S S W 



61 TTCCCTCAAA ATGCTACGGC 
19FPQN ATA 



121 GTGCTGCCGA CATTTATCAT 
39 V L P T FII 



181 CTGTTGGTCT TCCTCCTGCC 
59 L L V F LLP 



2 41 CTGGCAGCCT CTGATCTGGT 
79LAAS DLV 



301 AACCAGTTTA ACTGGCCTTT 
99NQFN WPF 



361 GCCAATTTGT TCATCAGCAT 
119 A N L F I S I 



421 C TGGTGC AC C CTATGGCCAG 
139 L V H P MAS 



481 GTGCTCATCT GGGTTGTGGG 
159 V L I W V V G 



541 CAAGCCGTCC CAGATCTGAA 
179 Q A V P D L N 



601 CACTTTGCAA GGATTGTGGA 
199 H F A R I V E 



661 GTCTTCTTCA ACTACCACAT 
219 V F F N Y H I 



721 AGAGTGC GGG GGCCGAAGGA 
239 R V R G P K D 



781 TTCCTGGTCT GCTGGGCCCC 
259 F L V C WAP 



841 CAAGCAGTCC GAGGCTGCTT 
279 Q A V R G C F 



Figure 4A 

GCCCCCTCTA GAGCTCCAAT 
PPL E L Q S 

CTGTGACAAT GCTC C AGAAG 
C D N A P E A 



CTCCATCTGT TTCTTCGGCC 
SIC F F G L 



CCGGCGGCAA CTGAACGTGG 
R R Q L N V A 



GTTTGTCTTG GGCTTGCCCT 
F V L G L P F 



CGGAGCCCTC CTCTGCCGTG 
GAL L C R V 



CTTCCTGGTG GTGGCCATCA 
F L V V A I S 



CGGAAGGCAG CAGCGGCGGA 
G R Q Q R R R 



GGGCCTCTTG AGCATCCCCA 
G L L S I P T 



CATCACCGCC TGCATCCTGC 
I T A C I L L 



GTTAAATATT CTGGGTTTCC 
L N I L G F L 



CCTGGCCTCC CTGCGAACGC 
LAS L R T R 



TAGCAAGACC ACAGCGCTGA 
S K T TALI 



TTACCACTTC TTTGCCTTCC 
Y H F F A F L 



TTGGGAGGAC TTCATTGACC 
WED F I D L 



CCTCCAACCA GAGCCAGCTC 60 

S N Q SQL 18 

CCTGGGACCT GCTGCACAGA 12 0 

W D L L H R 38 



TCCTAGGGAA CCTTTTTGTC 180 
L G N L F V 58 



CAGAAATCTA CCTGGCCAAC 240 
E I Y LAN 78 



TCTGGGCAGA GAATATCTGG 300 
W A E N I W 98 



TCATCAACGG GGTCATCAAG 3 60 
I N G V I K 118 



GCCAGGACCG CTACCGCGTG 42 0 
Q D R Y R V 138 



GGCAGGCCCG GGTCACCTGC 480 
Q A R V T C 158 



CATTCCTGCT GCGATCCATC 540 

F L L RSI 178 

TCCTCCCCCA TGAGGCCTGG 600 

L P H E A W 198 



TCCTACCACT GGCTGCGATC 660 
L P L A A I 218 



GGGAGGAGGT CAGCAGGACA 72 0 
E E V S R T 238 



TCCTCACGCT CGTGGTTGCC 7 80 
L T L V V A 258 



TGGAATTCTT ATTCCAGGTG 840 
E F L F Q V 278 



TGGGCCTGCA ATTGGCCAAC 900 
G L Q LAN 298 



D0053 NP 



Figure 4B 



901 TTCTTTGCCT TCACTAACAG CTCCCTGAAT CCAGTAATTT ATGTCTTTGT GGGCCAGCTC 960 

299 FFAF TNS SLN PVIY VFV GQL 318 

961 TTCAGGACCA AGGTC TGGG A AC TT TAT AAA CAATGCACCC CTAAAAGTCT TGCTCCAATA 1020 

319FRTK VWE LYK QCTP KSL API 338 



1021 TCTTCATCCC ATAGGAAAGA AATCTTCCAA CTTTTCTGGC GGAATTAAAA CAGCATTGAA 1080 
339 SSSH RKE I F Q LFWR N * 353 



1081 CC 1082 
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Figure 5A 

1 CTGTGCATGG CATCATCCTG GCCCCCTCTA GAGCTCCAAT CCTCCAACCA GAGCCAGCTC 60 
1 MA SSW PPL ELQS SNQ SQL 18 

61 TTCCCTCAAA ATGCTACGGC CTGTGACAAT GCTCCAGAAG CCTGGGACCT GCTGCACAGA 12 0 
19FPQN ATA CDN APEA WDL LHR 38 

121 GTGCTGCCAA CATTTATCAT CTCCATCTGT TTCTTCGGCC TCCTAGGGAA CCTTTTTGTC 180 
39VLPT FII SIC FFGL LGN LFV 58 

181 CTGTTGGTCT TCCTCCTGCC CCGGCGGCAA CTGAACGTGG CAGAAATCTA CCTGGCCAAC 240 
59LLVF LLP RRQ LNVA EIY LAN 78 

241 CTGGCAGCCT CTGATCTGGT GTTTGTCTTG GGCTTGCCCT TCTGGGCAGA GAATATCTGG 3 00 
79LAAS DLV FVL GLPF WAE NIW 98 

3 01 AACCAGTTTA ACTGGCCTTT CGGAGCCCTC CTCTGCCGTG TCATCAACGG GGTCATCAAG 3 60 
H 99NQFN WPF GAL LCRV ING V I K 118 



q 
ci 
m 



3 61 GCCAATTTGT TCATCAGCAT CTTCCTGGTG GTGGCCATCA GCCAGGACCG CTACCGCGTG 420 
119 ANLF ISI FLV VAIS QDR YRV 138 



jjjj 421 CTGGTGCACC CTATGGCCAG CGGAAGGCAG CAGCGGCGGA GGCAGGCCCG GGTCACCTGC 480 

139 LVHP MAS GRQ QRRR QAR VTC 158 

is 

N 481 GTGCTCATCT GGGTTGTGGG GGGCCTCTTG AGCATCCCCA CATTCCTGCT GC GAT C C AT C 540 

|f[j 159 VLIW VVG GLL SIPT FLL RSI 178 

M 

Ul 541 CAAGCCGTCC CAGATCTGAA CATCACCGCC TGCATCCTGC TCCTCCCCCA TGAGGCCTGG 600 

p 179 Q A V P DLN ITA CILL LPH EAW 198 



601 CACTTTGCAA GGATTGTGGA GTTAAATATT CTGGGTTTCC TCCTACCACT GGCTGCGATC 660 
199 HFAR IVE LNI LGFL LPL A A I 218 

661 GTCTTCTTCA ACTACCACAT CCTGGCCTCC CTGCGAACGC GGGAGGAGGT CAGCAGGACA 72 0 
219 VFFN YHI LAS LRTR EEV SRT 238 

721 AGAGTGC GGG GGCCGAAGGA TAGCAAGACC ACAGCGCTGA TCCTCACGCT CGTGGTTGCC 7 80 
239 RVRG PKD SKT TALI LTL VVA 258 

781 TTCCTGGTCT GCTGGGCCCC TTACCACTTC TTTGCCTTCC TGGAATTCTT ATTCCAGGTG 840 
259 FLVC WAP YHF FAFL EFL FQV 278 



841 CAAGCAGTCC GAGGCTGCTT TTGGGAGGAC TTCATTGACC TGGGCCTGCA ATTGGCCAAC 9 00 
279 QAVR GCF WED FIDL GLQ LAN 298 
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Figure 5B 



901 TTCTTTGCCT TCACTAACAG CTCCCTGAAT CCAGTAATTT ATGTCTTTGT GGGCCGGCTC 960 
299 FFAF TNS SLN PVIY VFV GRL 318 



961 TTCAGGACCA AGGTCTGGGA AC T TT AT AAA CAATGCACCC CTAAAAGTCT TGCTCCAATA 102 0 
319 FRTK VWE LYK QCTP KSL API 338 



1021 TCTTCATCCC ATAGGAAAGA AATCTTCCAA CTTTTCTGGC GGAATTAAAA CAGCATTGAA 1080 
339 SSSH RKE I F Q LFWR N * 353 



1081 CC 1082 



D0053 NP 



Figure 6A 

1 ATGTTCTCTC CCTGGAAGAT ATCAATGTTT CTGTCTGTTC GTGAGGACTC CGTGCCCACC 60 
1MFSP WKI SMF LSVR EDS VPT 20 

61 ACGGCCTCTT TCAGCGCCGA CATGCTCAAT GTCACCTTGC AAGGGCC C AC TCTTAACGGG 12 0 
21TASF SAD MLN V T L Q GPT LNG 40 

121 ACCTTTGCCC AGAGCAAATG CCCCCAAGTG GAGTGGCTGG GCTGGCTCAA CACCATCCAG 180 
41TFAQ SKC PQV EWLG W L N TIQ 60 

181 CCCCCCTTCC TCTGGGTGCT GTTCGTGCTG GCCACCCTAG AG AAC AT C T T TGTCCTCAGC 240 
61PPFL WVL F V L ATLE NIF VLS 80 

241 GTCTTCTGCC TGC AC AAGAG CAGCTGCACG GTGGCAGAGA TCTACCTGGG GAACCTGGCC 3 00 
81VFCL HKS SCT VAEI YLG NLA 100 

3 01 GCAGCAGACC TGATCCTGGC CTGCGGGCTG CCCTTCTGGG CCATCACCAT CTCCAACAAC 3 60 
101 AADL I L A C G L PFWA ITI SNN 120 

3 61 TTCGACTGGC TCTTTGGGGA GACGCTCTGC CGCGTGGTGA ATGCCATTAT CTCCATGAAC 42 0 
121 FDWL FGE TLC RVVN All SMN 140 

421 CTGTACAGCA GCATCTGTTT CCTGATGCTG GTGAGCATCG ACCGCTACCT GGCCCTGGTG 480 
141 LYSS ICF L M L VSID RYL ALV 160 

481 AAAACCATGT CCATGGGCCG GATGCGCGGC GTGCGCTGGG CCAAGCTCTA CAGCTTGGTG 540 
161KTMS MGR MRG VRWA KLY SLV 180 

541 ATCTGGGGGT GTACGCTGCT CCTGAGCTCA CCCATGCTGG TGTTCCGGAC CATGAAGGAG 600 
181 IWGC TLL LSS PMLV FRT MKE 200 

601 TACAGCGATG AGGGCCACAA CGTCACCGCT TGTGTCATCA GCTACCCATC CCTCATCTGG 660 
201 YSDE GHN VTA CVIS YPS LIW 220 

661 GAAGTGTTCA CCAACATGCT CCTGAATGTC GTGGGCTTCC TGCTGCCCCT GAGTGTCATC 72 0 
221 EVFT NML LNV VGFL LPL SVI 240 

721 ACCTTCTGCA CGATGCAGAT CATGCAGGTG CTGCGGAACA AC GAGATGC A GAAGTTCAAG 780 
241 TFCT MQI MQV LRNN EMQ KFK 260 

7 81 GAGATCCAGA CGGAGAGGAG GGCCACGGTG CTAGTCCTGG TTGTGCTGCT GCTATTCATC 840 
261 EIQT ERR A T V LVLV VLL LFI 280 



841 ATCTGCTGGC TGCCCTTCCA GATCAGCACC TTCCTGGATA CGCTGCATCG CCTCGGCATC 900 
281 ICWL PFQ 1ST FLDT LHR LGI 300 
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Figure 6B 

901 CTCTCCAGCT GCCAGGACGA GCGC AT CATC GATGTAATCA CACAGATCGC CTCCTTCATG 960 
301 LSSC QDE RII DVIT QIA SFM 320 

961 GCCTACAGCA ACAGCTGCCT CAACCCACTG GTGTACGTGA TCGTGGGCAA GCGCTTCCGA 102 0 
321 AYSN SCL NPL VYVI VGK RFR 340 

1021 AAGAAGTCTT GGGAGGTGTA CCAGGGAGTG TGCCAGAAAG GGGGCTGCAG GTCAGAACCC 1080 
341 KKSW EVY QGV CQKG GCR SEP 360 

10 81 ATTCAGATGG AGAACTCCAT GGGCACACTG CGGACCTCCA TCTCCGTGGA AC GC C AG ATT 1140 
361 IQME NSM GTL RTSI SVE RQI 380 

1141 CACAAACTGC AGGAC TGGGC AGGGAGCAGA CAGTGAGCAA ACGCCAGCAG GGCTGCTGTG 1200 
381 HKLQ DWA GSR Q* 391 

1201 AATTTGTGTA AGGATTGAGG GACAGTTGCT TTTCAGCATG GGCCCAGGAA TGCCAAGGAG 12 60 

1261 AC AT C TAT GC ACGACCTTGG GAAATGAGTT GATGTCTCCG GTAAAACACC GGAGACTAAT 132 0 

1321 TCCTGCCCTG CCCAATTTTG CAGGGAGCAT GGCTGTGAGG AT GGGGT G AA CTCACGCACA 13 80 

1381 GCCAAGGACT C C AAAAT C AC AACAGCATTA CTGTTCTTAT TTGCTGCCAC ACCTGAGCCA 1440 

1441 GCCTGCTCCT TCCCAGGAGT GGAGGAGGCC TGGGGGGAGG GAGAGGAGTG ACTGAGCTTC 1500 

1501 CCTCCCGTGT GTTCTCCGTC CCTGCCCCAG CAAGACAACT TAGATCTCCA GGAGAACTGC 1560 

1561 CATCCAGCTT TGGTGCAATG GCTGAGTGCA CAAGTGAGTT GTTGCCCTGG GTTTCTTTAA 162 0 

1621 TCTATTCAGC TAGAACTTTG AAGGACAATT TCTTGCATTA ATAAAGGTTA AGCCCTGAGG 1680 

1681 GGTCCCTGAT AACAACCTGG AGACCAGGAT TTTATGGCTC CCCTCACTGA TGGACAAGGA 1740 

1741 GGTCTGTGCC AAAGAAGAAT CCAATAAGCA CATATTGAGC ACTTGCTGTA TATGCAGTAT 1800 

1801 TGAGC AC TGT AGGC AAGAC C CAAGAAAGAG AAGGAGC CAT CTCCATCTTG AAGGAAC T C A 1860 

1861 AAGACTCAAG TGGGAAC GAC TGGGC ACTGC CACCACCAGA AAGCTGTTCG ACGAGACGGT 1920 

1921 CGAGCAGGGT GCTGTGGGTG ATATGGACAG C AG AAGGGGG AG AC C AAGGT TCCAGCTCAA 19 80 

19 81 CCAATAACTA TTGCACAACC ACCTGTCCCT GCCTCAGTTC CCTTTTATGT AACATGAAGT 2 040 
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Figv 

2 041 CGTTGTGAGG GTTAAAGGCA GTAACAGGTA 
2101 GTACATGTGA GGCATCATTA CGCAGACGTA 
2161 AC TGAGGTC T AGAAATAGCT CCGTGGAGCA 
2 221 AAGCACCAGT GTCTGGCACA CAGTAGGTGC 
2281 CCACCCTGAG GCCCCAACCG CCACACACAC 
2341 AAAGTCTGAT TTGTGATGAG GCAGAGGAAG 
2401 ACAGTGCTGA GACCCCCCAC CACCAGCCGG 
2461 TCAGGGACTG TTCCTGTCTC AGCAACCAAG 
2 521 GAAGGTGGCC CAGTATGAGC CCTAGAAGAG 
2 581 TCGGCAGTGC CAGGGCAGCA CTCATTCACT 
2 641 GAGCTAGAAC CTGGAGAGCT AGAACCTGGA 
2701 GAGGCTAGAA CCAAGAAGGG CTAGAACCTG 
27 61 TGAGCTAGAA GCTGGAGGAC TAGAACCTGG 
2 821 AGGGCTGGAA TCTGGAGAGC TAGAACCTGG 
2 881 AAGGGCTAGA AC C TGGAGGG CTGGAATCTG 

2 941 GAGGGCTAGA ACCTAGAAGG GCTAGAACCT 

3 001 AGAAGGGCTA GAACCTGGAG AGCCAGAACC 
3 061 CTGTAGAGCT AGAACATGGA GAGCTAGAAC 
3121 CTGGAGGGAA TGAACCTGGA GGGCTAGAAC 
3181 GAGCCCATAA ATCCTGACCA ATCCAACTCT 



re 6C 

TAAAGTACTT AGAAAAGCAA AGGGTGC T AC 2100 
AC TGGG AT AT GTTTACTATA AGGAAAAGAC 2160 
GAATCAGTAT TGGGAGCCGG TGGCGGTGTG 2220 
TCATTGGCTC CCTTCCACCT GTCATTCCCA 22 80 
AGGAGC AT T T GGAGAGAAGG CCATGTCTTC 2340 
ATATTTCTAA TCGGTCTTGC C C AG AGG AT C 2 400 
T AC C TGGGAA GGGGGAGAGT GCAGGCCTGC 2460 
GGATTGTTCC TGTCAATCAA TGGTTTATTG 2 52 0 
TGTGAAAAGG AATGGCAATG GTGTTCACCA 2 580 
TGATAAATGA ATATTTATTA GC TGGTTGG A 2 640 
GAACTAGAAC CTGGAGGGCT AGAACCTGGA 27 00 
GAGGGGCTAG AACCTAGAGA AGCTAAAACC 27 60 
AGGGCTGGAA TCTGAAGGGC TAGAACCTGG 2 82 0 
AGGGCTAGAA CCTGGAGGGC TAGAACCTAG 2 880 
GAGAGCTAGA AC C TGGAGGG CTAGAACCTG 2 940 
GGAGGGC T AG AACCTGGCAG GTTAGAACCT 3 000 
TGGAGGGC T A GAACCTGGAA GGGCTAGAAC 3 060 
CCGGCAGGCT AGAACCTGGC AAGCTAGAAC 312 0 
CTGGAGAATG AG AAAAAT T T ACATGGCAAA 3180 
GAATTTTAAA GC AAAAGCGT GAAAAAAAAG 3240 



D0053 NP 

Figure 6D 

32 41 ATTCCCTCCT TACCCCCAAC CCACTCTTTT TTCCCACCAC CCACTCTCCT CTGCCTCAGT 3 3 00 

33 01 AAGTATCTGG AGGAAGAAAA CAGGTGAAAG AAGAAGTAAA AACCATTTAG TATTAGTATT 33 60 
3361 AGAATGAAGT CAAACTGTGC CACACATGGT GAATGAAAAA AAAAAAAAAG AGGCTGTGTT 342 0 
3 421 TTGTCACACA GGGCAGTCAT TCAGCACCAG AGCACGTGAT GGTC TGAGAC TCTCTTAGGA 3480 
3481 GCAGAGCTCT GCCGCAATGG CCATGTGGGG ATCCACACCT GGTCTGAGGG GCAACTGAGT 3 540 
3 541 CTGCGGGAGA AG AGCGGC C C T ATGC ATGGT GTAGATGCCC TGATAAAGAA CATCTGTCCT 3600 
3601 GTGAAAGACT CAATGAGCTG TTATGTTGTA AACAGGAAGC ATTTCACATC CAAACGAGAA 3660 
3 661 AATCATGTAA ACATGTGTCT TTTCTGTAGA GCATAATAAA TGGATGAGGT TTTTGCAAAA 372 0 
3721 AAAAAAAAAA AAA 3733 
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Figure 7A 

1 AATTCAGAGC CACCGCGGGC AGGCGGGCAG TGCATCCAGA AGCGTTTATA TTCTGAGCGC 60 



61 CAGTTCAGCT TTCAAAAAGA GTGCTGCCCA 



121 AGAAGG AC C C TGAGCCCCAG GCGCCAGCCA 



181 CAGATAGTAG GCTTTACGCC TAGCTTCGAA 
1 



2 41 CTCTCCCCAA ACATCTCCAC TAACACCTCG 
11LSPN 1ST NTS 



3 01 CAAATTGTCC TTTGGGCAGC TGCCTACACG 
31QIVL W A A AYT 



3 61 GTGGTAGTGA TGTGGATCAT CTTAGCCCAC 
51 V V V M W I I LAH 



421 CTGGTGAACC TGGCCTTCGC GGAGGCCTCC 
71 L V N L AFA EAS 



481 ACCTATGCTG TCCACAACGA ATGGT AC TAC 
91TYAV HNE WYY 



541 TTTCCCATCG CCGCTGTCTT CGCCAGTATC 
111 FPIA A V F ASI 



TAAAAAGCCT TCCACCCTCC TGTCTGCTTT 12 0 



CAGGACTCTG CTGCAGAGGG GGGTTGTGTA 180 



ATGGATAACG TCCTCCCGGT GGACTCAGAC 240 
MDNV LPV DSD 10 



GAACCCAATC AGTTCGTGCA ACCAGCCTGG 3 00 
EPNQ FVQ PAW 30 



GTCATTGTGG TGACCTCTGT GGTGGGCAAC 3 60 
VIVV TSV VGN-50 



AAAAGAATGA GGACAGTGAC GAACTATTTT 42 0 
KRMR TVT NYF 70 



ATGGCTGCAT TCAATACAGT GGTGAACTTC 480 
MAAF NTV VNF 90 



GGCCTGTTCT ACTGCAAGTT CCACAACTTC 540 
GLFY CKF HNF 110 



TACTCCATGA CGGCTGTGGC CTTTGATAGG 600 
YSMT AVA FDR 130 



601 TACATGGCCA TCATACATCC 
131 Y M A I I H P 



661 ATCTGTGTCA TCTGGGTCCT 
151 I C V I W V L 



721 ACAGAGACCA TGCCCAGCAG 
171 T E T M P S R 



781 ATTTATGAGA AAGTGT AC C A 
191 I Y E K V Y H 



841 GTGATTGGCT ATGCATACAC 
211 V I G Y AYT 



901 GACTCCTCTG ACCGCTACCA 
231 D S S D R Y H 



CCTCCAGCCC CGGCTGTCAG 
L Q P R L S A 



GGCTCTCCTG CTGGCCTTCC 
ALL L A F P 



AGTCGTGTGC ATGATCGAAT 
V V C M I E W 



CATCTGTGTG ACTGTGCTGA 
I C V T V L I 



CGTAGTGGGA AT C AC AC TAT 
V V G I T L W 



CGAGCAAGTC TCTGCCAAGC 
E Q V S A K R 



CCACAGCCAC CAAAGTGGTC 660 
TAT K V V 150 



CCCAGGGCTA CTACTCAACC 72 0 
Q G Y Y S T 170 



GGCCAGAGCA TCCGAACAAG 7 80 
P E H P N K 190 



TCTACTTCCT CCCCCTGCTG 840 
Y F L P L L 210 



GGGCCAGTGA GATCCCCGGG 900 
A S E IPG 230 



GCAAGGTGGT CAAAATGATG 960 
K V V K M M 250 
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Figure 7B 

961 ATTGTCGTGG TGTGCACCTT CGCCATCTGC TGGCTGCCCT TCCACATCTT CTTCCTCCTG 1020 

251 IVVV CTF AIC WLPF HIF FLL 270 

1021 CCCTACATCA ACCCAGATCT CTACCTGAAG AAGTTTATCC AGCAGGTCTA CCTGGCCATC 1080 

271 PYIN PDL YLK KFIQ QVY LAI 290 

1081 ATGTGGCTGG CCATGAGCTC C AC C ATGT AC AACCCCATCA TCTACTGCTG CCTCAATGAC 1140 

291 MWLA MSS TMY NPII YCC LND 310 

1141 AGGTTCCGTC TGGGCTTCAA GCATGCCTTC CGGTGCTGCC CCTTCATCAG CGCCGGCGAC 1200 

311 RFRL GFK HAF RCCP FIS AGD 330 



12 01 TATGAGGGGC TGGAAATGAA ATCCACCCGG TATCTCCAGA CCCAGGGCAG TGTGTACAAA 12 60 

331 YEGL EMK STR YLQT QGS VYK 350 

1261 GTCAGCCGCC TGGAGACCAC CATCTCCACA GTGGTGGGGG CCCACGAGGA GGAGCCAGAG 1320 

351 VSRL ETT 1ST VVGA HEE EPE 370 



13 21 GACGGCCCCA AGGCCACACC CTCGTCCCTG GACCTGACCT CCAACTGCTC TTCACGAAGT 13 80 
371 DGPK ATP SSL DLTS NCS SRS 390 

13 81 GACTCCAAGA CCATGACAGA GAGCTTCAGC TTCTCCTCCA ATGTGCTCTC CTAGGCCACA 1440 
391 DSKT MTE SFS FSSN VLS * 407 

1441 GGGCCTTTGG CAGGTGCAGC CCCCACTGCC TTTGACCTGC CTCCCTTCAT GCATGGAAAT 1500 

1501 TCCCTTCATC TGGAACCATC AGAAAC AC C C TCACACTGGG ACTTGCAAAA AGGGTCAGTA 1560 

1561 TGGGTTAGGG AAAACATTCC ATCCTTGAGT CAAAAAATCT CAATTCTTCC CTATCTTTGC 1620 

1621 CACCCTCATG CTGTGTGACT CAAACCAAAT CACTGAACTT TGCTGAGCCT GTAAAATAAA 1680 

1681 AGGTCGGACC AGCTTTTCCT C AAGAGC CCA ATGCATTCCA TTTCTGGAAG TGACTTTGGC 1740 



1741 TGCATGCGAG TGCTCATTTC AGGATG 17 66 
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Figure 8A 

1 AATTCAGAGC CACCGCGGGC AGGCGGGCAG TGCATCCAGA AGCGTTTATA TTCTGAGCGC 60 



61 CAGTTCAGCT TTCAAAAAGA GTGCTGCCCA TAAAAAGCCT TCCACCCTCC TGTCTGCTTT 120 
121 AGAAGGACCC TGAGCCCCAG GCGCCAGCCA CAGGACTCTG CTGCAGAGGG GGGTTGTGTA 180 



181 CAGATAGTAG GCTTTACGCC TAGCTTCGAA ATGGATAACG TCCTCCCGGT GGACTCAGAC 240 

1 MDNVLPVDSD10 

241 CTCTCCCCAA ACATCTCCAC TAACACCTCG GAACCCAATC AGTTCGTGCA ACCAGCCTGG 3 00 

11LSPN 1ST NTS EPNQ F V Q PAW 30 

3 01 CAAATTGTCC TTTGGGCAGC TGCCTACACG GTCATTGTGG TGACCTCTGT GGTGGGCAAC 360 

31QIVL W A A AYT VIVV TSV VGN 50 

3 61 GTGGTAGTGA TGTGGATCAT CTTAGCCCAC AAAAGAATGA GGACAGTGAC GAACTATTTT 420 

51 V V V M W I I LAH KRMR TVT NYF 70 



421 CTGGTGAACC TGGCCTTCGC GGAGGCCTCC ATGGCTGCAT TCAATACAGT GGTGAACTTC 480 

71LVNL AFA EAS MAAF NTV VNF 90 

481 ACCTATGCTG TCCACAACGA ATGGTACTAC GGCCTGTTCT ACTGCAAGTT CCACAACTTC 540 

91TYAV HNE WYY GLFY CKF HNF 110 

541 TTCCCCATCG CCGCTGTCTT CGCCAGTATC TACTCCATGA CGGCTGTGGC CTTTGATAGG 600 

111 FPIA A V F ASI YSMT AVA FDR 130 



601 TACATGGCCA TCATACATCC CCTCCAGCCC CGGCTGTCAG CCACAGCCAC CAAAGTGGTC 660 

131 YMAI IHP LQP RLSA TAT KVV 150 

661 ATCTGTGTCA TCTGGGTCCT GGCTCTCCTG CTGGCCTTCC CCCAGGGCTA CTACTCAACC 72 0 

151 ICVI WVL ALL LAFP QGY YST 170 



721 ACAGAGACCA TGCCCAGCAG AGTCGTGTGC ATGATCGAAT GGCCAGAGCA TCCGAACAAG 780 
171 TETM PSR VVC MIEW PEH PNK 190 



7 81 ATTTATGAGA AAGTGT AC C A CATCTGTGTG ACTGTGCTGA TCTACTTCCT CCCCCTGCTG 840 

191 IYEK VYH ICV TVLI YFL PLL 210 

841 GTGATTGGCT ATGCATACAC CGTAGTGGGA ATC AC AC TAT GGGCCAGTGA GATCCCCGGG 900 

211 VIGY AYT VVG ITLW ASE IPG 230 

901 GACTCCTCTG ACCGCTACCA CGAGCAAGTC TCTGCCAAGC GC AAGGTGGT CAAAATGATG 960 

231 DSSD RYH EQV SAKR KVV KMM 250 
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961 ATTGTCGTGG 
251 I V V V 



1021 CCCTACATCA 
271 P Y I N 



1081 ATGTGGCTGG 
291 M W L A 



1141 AGGTTCCGTC 
311 R F R L 



12 01 TATGAGGGGC 
331 Y E G L 



1261 GTCAGCCGCC 

^ 351 V S R L 

O 

jp 1321 GACGGCCCCA 

;! ? ;| 371 D G P K 

ii 

iyf 13 81 G AC TC C AAGA 

p\ 391 D S K T 

H 1441 GGGCCTTTGG 



T'. 1501 TCCCTTCATC 

CI 

jU 15 61 TGGGTTAGGG 



1621 CACCCTCATG 
1681 AGGTCGGACC 




Figure 8B 

TGTGCACCTT CGCCATCTGC TGGCTGCCCT 
CTF AIC WLPF 



ACCCAGATCT CTACCTGAAG AAGTTTATCC 
PDL Y L K KFIQ 



CCATGAGCTC CACCATGTAC AACCCCATCA 
MSS TMY NPII 



TGGGCTTCAA GCATGCCTTC CGGTGCTGCC 
GFK HAF RCCP 



TGGAAATGAA ATCCACCCGG TATCTCCAGA 
EMK STR Y L Q T 



TGGAGAC C AC CATCTCCACA GTGGTGGGGG 
ETT 1ST VVGA 



AGGCCACACC CTCGTCCCTG GACCTGACCT 
ATP SSL DLTS 



CCATGACAGA GAGCTTCAGC TTCTCCTCCA 
MTE SFS FSSN 



CAGGTGCAGC CCCCACTGCC TTTGACCTGC 



TGGAACCATC AGAAAC AC C C TCACACTGGG 



AAAACATTCC ATCCTTGAGT CAAAAAATCT 



C T GTGTGAC T CAAACCAAAT CACTGAACTT 



AGCTTTTCCT C AAGAGC CCA ATGCATTCCA 




TCCACATCTT CTTCCTCCTG 102 0 
H I F F L L 270 



AGCAGGTCTA CCTGGCCATC 1080 
Q V Y LAI 290 



TCTACTGCTG CCTCAATGAC 1140 
Y C C L N D 310 



CCTTCATCAG CGCCGGCGAC 1200 
F I S A G D 330 



CCCAGGGCAG TGTGTACAAA 12 60 

Q G S V Y K 350 

CCCACGAGGA GGAGCCAGAG 132 0 

H E E E P E 370 



CCAACTGCTC TTCACGAAGT 13 80 
N C S S R S 390 



ATGTGCTCTC CTAGGCCACA 1440 
V L S * 407 



CTCCCTTCAT GCATGGAAAT 1500 



ACTTGCAAAA AGGGTCAGTA 15 60 



CAATTCTTCC CTATCTTTGC 162 0 



TGCTGAGCCT GTAAAATAAA 168 0 



TTTCTGGAAG TGACTTTGGC 1740 



1741 TGCATGCGAG TGCTCATTTC AGGATG 17 66 
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Figure 9A 

1 AATTCAGAGC CACCGCGGGC AGGCGGGCAG TGCATCCAGA AGCGTTTATA TTCTGAGCGC 60 
61 CAGTTCAGCT TTCAAAAAGA GTGCTGCCCA TAAAAAGCCT TCCACCCTCC TGTCTGCTTT 12 0 



121 AG AAGG AC C C TGAGCCCCAG GCGCCAGCCA CAGGACTCTG CTGCAGAGGG GGGT TGTGT A 180 



181 CAGATAGTAG GCTTTACGCC TAGCTTCGAA ATGGATAACG TCCTCCCGGT GGACTCAGAC 240 

MDNV LPV DSD 10 



241 CTCTCCCCAA ACATCTCCAC TAACACCTCG GAACCCAATC AGTTCGTGCA ACCAGCCTGG 300 
11LSPN 1ST NTS EPN Q FVQ PAW 30 



301 CAAATTGTCC TTTGGGCAGC TGCCTACACG GTCATTGTGG TGACCTCTGT GGTGGGCAAC 3 60 

31QIVL W A A AYT VIVV TSV VGN 50 

3 61 GTGGTAGTGA TGTGGATCAT CTTAGCCCAC AAAAGAATGA GGACAGTGAC GAACTATTTT 42 0 

51 V V V M W I I LAH KRMR TVT NYF 70 



421 CTGGTGAACC TGGCCTTCGC GGAGGCCTCC ATGGCTGCAT TCAATACAGT GGTGAACTTC 480 
71LVNL AFA EAS MAAF NTV VNF 90 



481 ACCTATGCTG TCCACAACGA ATGGT AC T AC GGCCTGTTCT ACTGCAAGTT CCACAACTTC 540 

91TYAV HNE WYY GLFY CKF HNF 110 

541 TTTCCCATCG CCGCTGTCTT CGCCAGTATC TACTCCATGA CGGCTGTGGC CTTTGATAGG 600 

111 F P I A A V F ASI YSMT AVA FDR 130 



601 T AC ATGGC C A TCATACATCC CCTCCAGCCC CGGCTGTCAG CCACAGCCAC CAAAGTGGTC 66 0 
131 YMAI IHP LQP RLSA TAT KVV 150 



661 ATCTGTGTCA TATGGGTCCT GGCTCTCCTG CTGGCCTTCC CCCAGGGCTA CTACTCAACC 72 0 
151 I C V I WVL ALL LAFP QGY YST 170 



721 ACAGAGACCA TGCCCAGCAG AGTCGTGTGC ATGATCGAAT GGCCAGAGCA TCCGAACAAG 7 80 

171 TETM PSR VVC MIEW PEH PNK 190 

7 81 ATTTATGAGA AAGTGTACCA CATCTGTGTG ACTGTGCTGA TCTACTTCCT CCCCCTGCTG 840 

191 IYEK VYH ICV TVLI YFL PLL 210 
• 

841 GTGATTGGCT ATGC AT AC AC CGTAGTGGGA ATC AC AC TAT GGGCCAGTGA GATCCCCGGG 900 

211 VIGY AYT VVG ITLW ASE IPG 230 

9 01 GACTCCTCTG ACCGCTACCA CGAGCAAGTC TCTGCCAAGC GCAAGGTGGT CAAAATGATG 960 

231 DSSD RYH EQV SAKR KVV KMM 250 
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Figure 9B 

961 ATTGTCGTGG TGTGCACCTT CGCCATCTGC TGGCTGCCCT TCCACATCTT CTTCCTCCTG 1020 

251 IVVV CTF AIC W L P F HIF F L L 270 

1021 CCCTACATCA AC C C AG AT C T CTACCTGAAG AAGTTTATCC AGCAGGTCTA CCTGGCCATC 1080 

271 PYIN PDL YLK KFIQ Q V Y LAI 290 

1081 ATGTGGC TGG CCATGAGCTC C AC C AT GT AC AACCCCATCA TCTACTGCTG CCTCAATGAC 1140 

291 MWLA MSS TMY NPII YCC LND 310 

1141 AGGTTCCGTC TGGGCTTCAA GCATGCCTTC CGGTGCTGCC CCTTCATCAG CGCCGGCGAC 12 00 

311 RFRL GFK HAF RCCP FIS AGD 330 

12 01 TATGAGGGGC TGGAAATGAA ATCCACCCGG TATCTCCAGA CCCAGGGCAG TGTGTACAAA 12 60 

331 YEGL EMK STR YLQT QGS VYK 350 

12 61 GTCAGCCGCC TGGAGACCAC CATCTCCACA GTGGTGGGGG CCCACGAGGA GGAGCCAGAG 132 0 

H»* 351 VSRL ETT 1ST VVGA HEE EPE 370 

a 

f*5 ...... 

; ;;;;f 1321 GACGGCCCCA AGGCCACACC CTCGTCCCTG GACCTGACCT CCAACTGCTC TTCACGAAGT 13 80 

Q 371 DGPK ATP SSL DLTS NCS SRS 390 

Efi 

•13" t 

■ E e £ ...... 



1381 GACTCCAAGA CCATGACAGA GAGCTTCAGC TTCTCCTCCA ATGTGCTCTC CTAGGCCACA 1440 
391 DSKT MTE SFS FSSN VLS * 407 

1441 GGGCCTTTGG CAGGTGCAGC CCCCACTGCC TTTGACCTGC CTCCCTTCAT GCATGGAAAT 1500 



W. 

m 

G 1501 TCCCTTCATC TGGAACCATC AGAAACACCC TCACACTGGG ACTTGCAAAA AGGGTCAGTA 1560 



1561 TGGGTTAGGG AAAACATTCC ATCCTTGAGT CAAAAAATCT CAATTCTTCC CTATCTTTGC 1620 
1621 CACCCTCATG CTGTGTGACT CAAACCAAAT CACTGAACTT TGCTGAGCCT GTAAAATAAA 1680 
1681 AGGTCGGACC AGCTTTTCCT CAAGAGCCCA ATGCATTCCA TTTCTGGAAG TGACTTTGGC 1740 
1741 TGCATGCGAG TGCTCATTTC AGGATG 1766 
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1 AATTCAGAGC CACCGCGGGC 



61 CAGTTCAGCT TTCAAAAAGA 



121 AG AAGG AC C C TGAGCCCCAG 



181 CAGATAGTAG GCTTTACGCC 
1 



241 CTCTCCCCAA ACATCTCCAC 
11LSPN 1ST 



3 01 CAAATTGTCC TTTGGGCAGC 
31QIVL W A A 

3 61 GTGGTAGTGA TGTGGATCAT 
51 V V V M W I I 



421 CTGGTGAACC TGGCCTTCGC 
71LVNL AFA 



481 ACCTATGCTG TCCACAACGA 
91TYAV HNE 



541 TTTCCCATCG CCGCTGTCTT 
111 F P I A A V F 



601 T AC ATGGC C A T CAT AC AT C C 
131 Y M A I I H P 



6 61 ATCTGTGTCA TCTGGGTCCT 
151 I C V I W V L 



721 ACAGAGACCA TGCCCAGCAG 
171 T E T M P S R 



7 81 ATTTATGAGA AAGTGTACCA 
191 I Y E K V Y H 



841 GTGATTGGCT AT GC AT AC AC 
211 V I G Y A Y T 



901 GACTCCTCTG ACCGCTACCA 
231 D S S D R Y H 



Figure 10A 

AGGCGGGCAG TGCATCCAGA 



GTGCTGCCCA T AAAAAGC C T 
GCGCCAGCCA CAGGACTCTG 



TAGCTTCGAA ATGGATAACG 
M D N V 



TAACACCTCG GAACCCAATC 
NTS E P N Q 



TGCCTACACG GTCATTGTGG 
A Y T V I V V 



CTTAGCCCAC AAAAGAATGA 
L A H K R M R 



GGAGGCCTCC ATGGC TGC AT 
E A S M A A F 



ATGGT AC T AC GGCCTGTTCT 
W Y Y G L F Y 



CGCCAGTATC TACTCCATGA 
A S I Y S M T 



CCTCCAGCCC CGGCTGTCAG 
L Q P R L S A 



GGCTCTCCTG CTGGCCTTCC 
ALL L A F P 



AGTCGTGTGC ATGATCGAAT 
V V C M I E W 



CATCTGTGTG AC TGTGC TGA 
I C V T V L I 



CGTAGTGGGA AT C AC AC TAT 
V V G I T L W 



CGAGCAAGTC TCTGCCAAGC 
E Q V S A K R 



AGCGTTTATA TTCTGAGCGC 60 



TCCACCCTCC TGTCTGCTTT 12 0 



C TGC AGAGGG GGGTTGTGTA 180 

TCCTCCCGGT GGACTCAGAC 240 
L P V DSD 10 



AGTTCGTGCA ACCAGCCTGG 3 00 

F V Q PAW 30 

TGACCTCTGT GGTGGGCAAC 3 60 

T S V V G N 50 



GGACAGTGAC GAACTATTTT 42 0 
T V T N Y F 70 



TCAATACAGT GGTGAACTTC 480 
N T V V N F 90 



AC TGC AAGT T CCACAACTTC 540 
C K F H N F 110 



CGGCTGTGGC CTTTGATAGG 600 
A V A FDR 130 



CCACAGCCAC CAAAGTGGTC 660 
TAT K V V 150 



CCCAGGGCTA CTACTCAACC 72 0 
Q G Y Y S T 170 



GGCCAGAGCA TCCGAACAAG 7 80 
P E H P N K 190 



TCTACTTCCT CCCCCTGCTG 840 
Y F L P L L 210 



GGGCCAGTGA GATCCCCGGG 900 
A S E IPG 230 



GCAAGGTGGT CAAAATGATG 960 
K V V K M M 250 
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Figure 10B 

961 ATTGTCGTGG TGTGCACCTT CGCCATCTGC TGGCTGCCCT TCCACATCTT CTTCCTCCTG 1020 

251 IVVV CTF AIC WLPF HIF FLL 270 

1021 CCCTACATCA ACCCAGATCT CTACCTGAAG AAGTTTATCC AGCAGGTCTA CCTGGCCATC 1080 

271 PYIN PDL YLK KFIQ QVY LAI 290 



1081 ATGTGGCTGG CCATGAGCTC CACCATGTAC AACCCCATCA TCTACTGCTG CCTCAATGAC 1140 
291 MWLA MSS TMY NPII YCC LND 310 

1141 AGGTTCCGTC TGGGCTTCAA GCATGCCTTC CGGTGCTGCC CCTTCATCAG CGCCGGCGAC 12 00 
311 RFRL GFK HAF RCCP F I S AGD 330 

1201 TATGAGGGGC TGGAAATGAA ATCCACCCGG TATCTCCAGA CCCAGGGCAG TGTGTACAAA 1260 
331 YEGL EMK STR YLQT QGS VYK 350 

12 61 GTCAGCCGCC TGGAGACCAC CATCTCCACA GTGGTGGGGG CCCACGAGGA GGAGCCAGAG 13 2 0 
351 VSRL ETT 1ST VVGA HEE EPE 370 

13 21 GACGGCCCCA AGGCCACACC CTCATCCCTG GACCTGACCT CCAACTGCTC TTCACGAAGT 1380 
371DGPK ATP SSL DLTS NCS SRS 390 



13 81 GACTCCAAGA CCATGACAGA GAGCTTCAGC TTCTCCTCCA ATGTGCTCTC CTAGGCCACA 1440 
391 DSKT MTE SFS FSSN VLS * 407 



1441 GGGCCTTTGG CAGGTGCAGC CCCCACTGCC TTTGACCTGC CTCCCTTCAT GCATGGAAAT 1500 
1501 TCCCTTCATC TGGAACCATC AGAAAC AC C C T C AC AC TGGG ACTTGCAAAA AGGGTCAGTA 1560 



1561 TGGGTTAGGG AAAACATTCC ATCCTTGAGT CAAAAAATCT CAATTCTTCC CTATCTTTGC 1620 
1621 CACCCTCATG CTGTGTGACT CAAACCAAAT CACTGAACTT TGCTGAGCCT GTAAAATAAA 1680 



1681 AGGTCGGACC AGCTTTTCCT CAAGAGCCCA ATGCATTCCA TTTCTGGAAG TGACTTTGGC 1740 



1741 TGC ATGC GAG TGCTCATTTC AGGATG 17 66 



* 
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Figure 11A 

1 AGTCTGCACT GGAGCTGCCT GGTGACCAGA AGTTTGGAGT CCGCTGACGT CGCCGCCCAG 60 

61 ATGGCCTCCA GGCTGACCCT GCTGACCCTC CTGCTGCTGC TGCTGGCTGG GGATAGAGCC 12 0 
1MASR LTL LTL L L L L LAG DRA 20 

121 TCCTCAAATC CAAATGCTAC CAGCTCCAGC TCCCAGGATC CAGAGAGTTT GC AAG AC AG A 180 
21SSNP NAT SSS SQDP E S L QDR 40 

181 GGCGAAGGGA AGGTCGCAAC AACAGTTATC TCCAAGATGC TATTCGTTGA ACCCATCCTG 240 
41GEGK VAT TVI SKML FVE P I L 60 

2 41 GAGGTTTCCA GCTTGCCGAC AACCAACTCA ACAACCAATT CAGCCACCAA AATAACAGCT 3 00 
61EVSS LPT T N S TTNS ATK ITA 80 

3 01 AATACCACTG AT GAAC C C AC CACACAACCC AC C AC AGAGC CCACCACCCA ACCCACCATC 3 60 
81NTTD EPT TQP TTEP TTQ PTI 100 

3 61 CAACCCACCC AACCAACTAC CCAGCTCCCA ACAGATTCTC CTACCCAGCC C AC T AC TGGG 42 0 
101 QPTQ PTT QLP TDSP TQP TTG 120 



q 

m ...... 

y!| 421 TCCTTCTGCC CAGGACCTGT TACTCTCTGC TCTGACTTGG AGAGTCATTC AACAGAGGCC 480 

U\ 121 SFCP GPV TLC SDLE SHS TEA 140 

m 

;i; 481 GTGTTGGGGG ATGCTTTGGT AGATTTCTCC CTGAAGCTCT ACCACGCCTT CTCAGCAATG 540 

jk 141 VLGD ALV DFS LKLY HAF SAM 160 

rii 
H if 

J"? 541 AAGAAGGTGG AGACCAACAT GGCCTTTTCC CCATTCAGCA TCGCCAGCCT CCTTACCCAG 600 

Ul 161 KKVE TNM AFS PFSI ASL LTQ 180 

Q 

U ...... 

601 GTCCTGCTCG GGGC TGGGC A GAAC AC C AAA AC AAACC TGG AGAGC ATCC T CTCTTACCCC 660 

181 VLLG A G Q NTK TNLE SIL SYP 200 



661 AAGGAC T TC A CCTGTGTCCA CCAGGCCCTG AAGGGCTTCA CGACCAAAGG TGTCACCTCA 720 
201 KDFT CVH QAL KGFT TKG VTS 220 



721 GTCTCTCAGA TCTTCCACAG CCCAGACCTG GC C AT AAGGG ACACCTTTGT GAATGCCTCT 780 

221 VSQI FHS PDL AIRD TFV NAS 240 

781 CGGACCCTGT ACAGCAGCAG CCCCAGAGTC CTAAGCAACA AC AGTGAC GC CAACTTGGAG 840 

241 RTLY SSS PRV LSNN SDA NLE 260 

841 CTCATCAACA CCTGGGTGGC CAAGAACACC AACAACAAGA TCAGCCGGCT GCTAGACAGT 900 

261 LINT WVA KNT NNKI SRL LDS 280 

901 CTGCCCTCCG ATACCCGCCT TGTCCTCCTC AATGCTATCT AC C TGAGT GC CAAGTGGAAG 96 0 

281 LPSD TRL VLL NAIY LSA KWK 300 
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Figure 11B 



961 ACAACATTTG ATCCCAAGAA AACCAGAATG GAACCCTTTC ACTTCAAAAA CTCAGTTATA 102 0 
301TTFD PKK TRM EP FH FKN SVI 320 



1021 AAAGTGC CCA TGATGAATAG CAAGAAGTAC CCTGTGGCCC ATTTCATTGA CCAAACTTTG 1080 
321KVPM MNS KKY PVAH FID QTL 340 



1081 AAAGCCAAGG TGGGGCAGCT GCAGCTCTCC CACAATCTGA GTTTGGTGAT CCTGGTACCC 1140 
341 KAKV GQL QLS HNLS LVI LVP 360 



1141 CAGAACCTGA AACATCGTCT TGAAGACATG GAACAGGCTC TCAGCCCTTC TGTTTTCAAG 12 00 

361QNLK HRL EDM E Q A L SPS V F K 380 

12 01 GC CATC ATGG AGAAACTGGA GATGTCCAAG TTCCAGCCCA CTCTCCTAAC ACTACCCCGC 12 60 

381AIME KLE MSK FQPT L L T L P R 400 



12 61 ATCAAAGTGA CGACCAGCCA GGATATGCTC TCAATCATGG AGAAATTGGA ATTCTTCGAT 13 2 0 
401 IKVT TSQ D M L SIME KLE FFD 420 



1321 TTTTCTTATG ACCTTAACCT GTGTGGGCTG AC AGAGG AC C CAGATCTTCA GGTTTCTGCG 13 80 
421FSYD LNL CGL TEDP DLQ VSA 440 



13 81 ATGCAGCACC AG AC AGTGC T GGAACTGACA GAGACTGGGG TGGAGGC GGC TGCAGCCTCC 1440 

441MQHQ TVL ELT ETGV EAA AAS 460 

1441 GCCATCTCTG TGGCCCGCAC CCTGCTGGTC TTTGAAGTGC AGCAGCCCTT CCTCTTCGTG 1500 

461AISV ART LLV FEVQ QPF LFV 480 



1501 CTCTGGGACC AGCAGCACAA GTTCCCTGTC TTCATGGGGC GAGTATATGA CCCCAGGGCC 156 
481LWDQ QHK FPV FMGR VYD PRA 500 



1561 TGAGACCTGC AGGATCAGGT T AGGGC GAGC GCTACCTCTC CAGCCTCAGC TCTCAGTTGC 1620 



1621 AGCCCTGCTG CTGCCTGCCT GGACTTGCCC CTGCCACCTC CTGCCTCAGG TGTCCGCTAT 1680 



16 81 CCACCAAAAG GGCTCCTGAG GGTCTGGGCA AGGGACCTGC TTCTATTAGC CCTTCTCCAT 1740 



1741 GGCCCTGCCA TGCTCTCCAA ACCACTTTTT GCAGCTTTCT CTAGTTCAAG TTCACCAGAC 1800 



1801 TCTATAAATA AAACCTGACA GACCAT 182 6 
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Figure 12A 

1 AGTCTGCACT GGAGCTGCCT GGTGACCAGA AGTTTGGAGT CCGCTGACGT CGCCGCCCAG 60 



61 ATGGCCTCCA GGCTGACCCT GCTGACCCTC CTGCTGCTGC TGCTGGCTGG GGATAGAGCC 12 0 
1MASR LTL LTL LLLL LAG DRA 20 



121 TCCTCAAATC C AAATGC T AC CAGCTCCAGC TCCCAGGATC CAGAGAGTTT GCAAGACAGA 180 
21SSNP NAT SSS SQDP ESL QDR 40 



181 GGC GAAGGGA AGGTCGCAAC AACAGTTATC TCCAAGATGC TATTCGTTGA ACCCATCCTG 240 
41GEGK VAT TVI SKML FVE P I L 60 



241 GAGGTTTCCA GCTTGCCGAC AACCAACTCA ACAACCAATT CAGCCACCAA AATAACAGCT 3 00 
61EVSS LPT TNS TTNS ATK I T A 80 



9 

Q 



a 



3 01 AATACCACTG AT GAAC C C AC CACACAACCC ACCACAGAGC CCACCACCCA ACCCACCATC 3 60 

81NTTD EPT TQP TTEP TTQ PTI 100 

3 61 CAACCCACCC AACCAACTAC CCAGCTCCCA ACAGATTCTC CTACCCAGCC C AC T AC TGGG 42 0 

101 QPTQ PTT QLP TDSP TQP TTG 120 



...... 

y!) 421 TCCTTCTGCC CAGGACCTGT TACTCTCTGC TCTGACTTGG AGAGTCATTC AACAGAGGCC 480 

121SFCP GPV TLC SDLE SHS TEA 140 

Mi 

* 4 81 GTGTTGGGGG ATGCTTTGGT AGATTTCTCC CTGAAGCTCT ACCACGCCTT CTCAGCAATG 540 

H 141 VLGD ALV DFS LKLY HAF SAM 160 

111 

f* ...... 

f : l 541 AAGAAGGTGG AGACCAACAT GGCCTTTTCC CCATTCAGCA TCGCCAGCCT CCTTACCCAG 600 

W 161 KKVE TNM AFS PFSI ASL LTQ 180 



601 GTCCTGCTCG GGGC TGGGC A GAACACCAAA ACAAACCTGG AGAGCATCCT CTCTTACCCC 660 
181 VLLG A G Q NTK TNLE SIL SYP 200 



661 AAGGACTTCA CCTGTGTCCA CCAGGCCCTG AAGGGCTTCA CGACCAAAGG TGTCACCTCA 72 0 
201 KDFT CVH QAL KGFT TKG VTS 220 



721 GTCTCTCAGA TCTTCCACAG CCCAGACCTG GCCATAAGGG ACACCTTTGT GAATGCCTCT 780 
221 VSQI FHS PDL AIRD TFV NAS 240 



781 CGGACCCTGT ACAGCAGCAG CCCCAGAGTC CTAAGCAACA ACAGTGACGC CAACTTGGAG 840 
241 RTLY SSS PRV LSNN SDA NLE 260 



841 CTCATCAACA CCTGGGTGGC CAAGAACACC AACAACAAGA TCAGCCGGCT GCTAGACAGT 900 
261 LINT WVA KNT NNKI SRL LDS 280 



9 01 CTGCCCTCCG ATACCCGCCT TGTCCTCCTC AATGCTATCT ACCTGAGTGC CAAGTGGAAG 960 
281 LPSD TRL VLL NAIY LSA KWK 300 
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Figure 12B 



961 ACAACATTTG ATCCCAAGAA AACCAGAATG GAACCCTTTC ACTTCAAAAA CTCAGTTATA 102 0 
301 TTFD PKK TRM EPFH FKN SVI 320 

1021 AAAGTGCCCA TGATGAATAG CAAGAAGTAC CCTGTGGCCC ATTTCATTGA CCAAACTTTG 10 80 
321 KVPM MNS KKY PVAH FID QTL 340 

1081 AAAGCCAAGG TGGGGCAGCT GCAGCTCTCC CACAATCTGA GTTTGGTGAT CCTGGTACCC 1140 
341 KAKV GQL QLS HNLS LVI LVP 360 

1141 CAGAACCTGA AACATCGTCT TGAAGACATG GAACAGGCTC TCAGCCCTTC TGTTTTCAAG 12 00 
361 QNLK HRL EDM EQAL SPS VFK 380 

12 01 GCCATCATGG AG AAAC TGG A GATGTCCAAG TTCCAGCCCA CTCTCCTAAC ACTACCCCGC 12 6 0 
381 AIME KLE MSK FQPT LLT LPR 400 

H 12 61 AT C AAAGTGA CGACCAGTCA GGATATGCTC TCAATCATGG AGAAATTGGA ATTCTTCGAT 1320 

ip 401 IKVT TSQ DML SIME KLE FFD 420 

Q 

; h ; r 1321 TTTTCTTATG ACCTTAACCT GTGTGGGCTG AC AG AGGAC C CAGATCTTCA GGTTTCTGCG 1380 

SJl 421FSYD LNL CGL TEDP DLQ VSA 440 

• •** 

j; ?? ...... 

13 81 ATGCAGCACC AGACAGTGCT GGAAC TGAC A GAGAC TGGGG TGGAGGC GGC TGCAGCCTCC 1440 
441 MQHQ TVL ELT ETGV E A A A A S 460 

f\i 1441 GCCATCTCTG TGGCCCGCAC CCTGCTGGTC TTTGAAGTGC AGCAGCCCTT CCTCTTCGTG 15 00 

!L5i 461 AISV ART LLV FEVQ QPF LFV 480 

y ...... 

Q 1501 CTCTGGGACC AGCAGCACAA GTTCCCTGTC TTCATGGGGC GAGTATATGA CCCCAGGGCC 15 60 

481 LWDQ QHK FPV FMGR VYD PRA 500 

1561 TG AG AC C T GC AGGATCAGGT TAGGGCGAGC GCTACCTCTC CAGCCTCAGC TCTCAGTTGC 162 0 

1621 AGCCCTGCTG CTGCCTGCCT GGACTTGCCC CTGCCACCTC CTGCCTCAGG TGTCCGCTAT 16 80 

1681 CCACCAAAAG GGCTCCTGAG GGTCTGGGCA AGGGACCTGC TTCTATTAGC CCTTCTCCAT 1740 



17 41 GGCCCTGCCA TGCTCTCCAA ACCACTTTTT GCAGCTTTCT CTAGTTCAAG T TC AC C AG AC 18 00 



1801 TCTATAAATA AAAC C TG AC A G AC CAT 182 6 
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Figure 13A 

1 AGTCTGCACT GGAGCTGCCT GGTGACCAGA AGTTTGGAGT CCGCTGACGT CGCCGCCCAG 60 

61 ATGGCCTCCA GGCTGACCCT GCTGACCCTC CTGCTGCTGC TGCTGGCTGG GGATAGAGCC 120 
1MASR LTL LTL L L L L LAG DRA 20 

121 TCCTCAAATC CAAATGCTAC CAGCTCCAGC TCCCAGGATC C AG AGAGTT T GC AAG AC AG A 180 
21SSNP NAT SSS SQDP ESL QDR 40 

181 GGC GAAGGG A AGGTCGCAAC AACAGTTATC TCCAAGATGC TATTCGCTGA ACCCATCCTG 240 
41GEGK VAT TVI SKML F A E P I L 60 

241 GAGGTTTCCA GCTTGCCGAC AACCAACTCA ACAACCAATT CAGCCACCAA AATAACAGCT 300 
61EVSS LPT TNS TTNS ATK ITA 80 

3 01 AATACCACTG ATGAAC C C AC CACACAACCC AC C AC AGAGC CCACCACCCA ACCCACCATC 360 
81NTTD EPT TQP TTEP TTQ PTI 100 

b ...... 

p 3 61 CAACCCACCC AACCAACTAC CCAGCTCCCA ACAGATTCTC CTACCCAGCC CACTACTGGG 42 0 

.F»* 101 QPTQ PTT QLP TDSP TQP TTG 120 

W 

>fa ! l . m . . . . . 

! £) 421 TCCTTCTGCC CAGGACCTGT TACTCTCTGC TCTGACTTGG AGAGTC ATT C AACAGAGGCC 480 

{J! 121SFCP GPV TLC SDLE SHS TEA 140 

m 

: 481 GTGTTGGGGG ATGCTTTGGT AGATTTCTCC CTGAAGCTCT ACCACGCCTT CTCAGCAATG 540 

H 141 VLGD ALV DFS LKLY HAF SAM 160 

til 

p ...... 

541 AAGAAGGTGG AGACCAACAT GGCCTTTTCC CCATTCAGCA TCGCCAGCCT CCTTACCCAG 6 00 

tel 161 KKVE TNM AFS PFSI ASL LTQ 180 

P 

!b:fc ...... 

601 GTCCTGCTCG GGGCTGGGCA GAAC AC C AAA ACAAACCTGG AGAGCATCCT CTCTTACCCC 660 
181 VLLG A G Q NTK TNLE SIL SYP 200 

661 AAGGACTTCA CCTGTGTCCA CCAGGCCCTG AAGGGCTTCA CGACCAAAGG TGTCACCTCA 72 0 
201 KDFT CVH QAL KGFT TKG VTS 220 

721 GTCTCTCAGA TCTTCCACAG CCCAGACCTG GC C AT AAGGG ACACCTTTGT GAATGCCTCT 7 80 
221 VSQI FHS PDL AIRD TFV NAS 240 

781 CGGACCCTGT ACAGCAGCAG CCCCAGAGTC CTAAGCAACA ACAGTGACGC CAACTTGGAG 840 
241 RTLY SSS PRV LSNN SDA NLE 260 

841 CTCATCAACA CCTGGGTGGC CAAGAACACC AACAACAAGA TCAGCCGGCT GCTAGACAGT 900 
261 LINT WVA KNT NNKI SRL LDS 280 



901 CTGCCCTCCG ATACCCGCCT TGTCCTCCTC AATGCTATCT ACCTGAGTGC CAAGTGGAAG 960 
281 LPSD TRL VLL NAIY LSA KWK 300 
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• 



9 61 ACAACATTTG ATCCCAAGAA 
301 T T F D P K K 



1021 AAAGTGCCCA TGATGAATAG 
321 K V P M M N S 



1081 AAAGCCAAGG TGGGGCAGCT 
341 K A K V G Q L 



1141 CAGAACCTGA AACATCGTCT 
361 Q N L K H R L 



12 01 GCCATCATGG AGAAAC TGGA 
381 A I M E K L E 



1261 ATCAAAGTGA CGACCAGCCA 
401 I K V T T S Q 

1321 TTTTCTTATG ACCTTAACCT 
421 F S Y D L N L 

13 81 ATGCAGCACC AGACAGTGCT 
441 M Q H Q T V L 

1441 GCCATCTCTG TGGCCCGCAC 
461 A I S V ART 



1501 CTCTGGGACC AGCAGCACAA 
481 L W D Q Q H K 



1561 TGAGACCTGC AGGATCAGGT 



1621 AGCCCTGCTG CTGCCTGCCT 



1681 CCACCAAAAG GGCTCCTGAG 



1741 GGCCCTGCCA TGCTCTCCAA 



1801 TCTATAAATA AAACCTGACA 



Figure 13B 

AACCAGAATG GAACCCTTTC 
T R M E P F H 



CAAGAAGTAC CCTGTGGCCC 
K K Y P V A H 



GCAGCTCTCC CACAATCTGA 

Q L S H N L S 

TGAAGACATG GAACAGGCTC 

EDM E Q A L 



GATGTCCAAG TTCCAGCCCA 
M S K F Q P T 



GGATATGCTC TCAATCATGG 
D M L S I M E 

GTGTGGGCTG AC AGAGG AC C 
C G L T E D P 

GGAAC TGAC A GAGAC TGGGG 
E L T E T G V 

CCTGCTGGTC TTTGAAGTGC 
L L V F E V Q 

GTTCCCTGTC TTCATGGGGC 
F P V F M G R 

TAGGGCGAGC GCTACCTCTC 



GGACTTGCCC CTGCCACCTC 



GGTCTGGGCA AGGGACCTGC 
ACCACTTTTT GCAGCTTTCT 



GACCAT 182 6 



ACTTCAAAAA CTCAGTTATA 1020 
F K N S V I 320 



ATTTCATTGA CCAAACTTTG 10 80 

FID Q T L 340 

GTTTGGTGAT CCTGGTACCC 1140 

L V I L V P 360 



TCAGCCCTTC TGTTTTCAAG 12 00 

S P S V F K 380 

CTCTCCTAAC ACTACCCCGC 12 60 

L L T L P R 400 

AG AAAT TGGA ATTCTTCGAT 13 2 0 

K L E F F D 420 



CAGATCTTCA GGTTTCTGCG 1380 
D L Q V S A 440 

TGGAGGCGGC TGCAGCCTCC 1440 
E A A A A S 460 



AGCAGCCCTT CCTCTTCGTG 150 0 
Q P F L F V 480 



GAGTATATGA CCCCAGGGCC 1560 

V Y D P R A 500 

CAGCCTCAGC TCTCAGTTGC 1620 

CTGCCTCAGG TGTCCGCTAT 1680 

TTCTATTAGC CCTTCTCCAT 17 40 

CTAGTTCAAG TTCACCAGAC 1800 
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1 AGTCTGCACT GGAGCTGCCT 



61 ATGGCCTCCA GGCTGACCCT 
1MASR LTL 



121 TCCTCAAATC CAAATGCTAC 
21SSNP NAT 



181 GGCGAAGGGA AGGTCGCAAC 
41GEGK VAT 



2 41 GAGGTT T CCA GCTTGCCGAC 
61EVSS LPT 



3 01 AATACCACTG AT GAAC C C AC 
81NTTD EPT 



3 61 CAACCCACCC AACCAACTAC 
101 Q P T Q P T T 



421 TCCTTCTGCC CAGGACCTGT 
121 S F C P G P V 



481 GTGTTGGGGG ATGCTTTGGT 
141 V L G D A L V 



541 AAGAAGGTGG AGACCAACAT 
161 K K V E T N M 



601 GTCCTGCTCG GGGCTGGGCA 
181 V L L G A G Q 



661 AAGGACTTCA CCTGTGTCCA 
201 K D F T C V H 



721 GTCTCTCAGA TCTTCCACAG 
221 V S Q I F H S 



781 CGGACCCTGT ACAGCAGCAG 
241 R T L Y S S S 



841 CTCATCAACA CCTGGGTGGC 
261 LINT W V A 



901 CTGCCCTCCG ATACCCGCCT 
281 L P S D T R L 



Figure 14A 

GGTGACC AGA AGTTTGGAGT 



GCTGACCCTC CTGCTGCTGC 
LTL L L L L 



CAGCTCCAGC TCCCAGGATC 
S S S S Q D P 



AACAGTTATC TCCAAGATGC 
T V I S K M L 



AACCAACTCA ACAACCAATT 
T N S T T N S 



CACACAACCC AC C AC AG AGC 
T Q P T T E P 



CCAGCTCCCA ACAGATTCTC 
Q L P T D S P 



TACTCTCTGC TCTGACTTGG 
T L C S D L E 



AGATTTCTCC CTGAAGCTCT 
D F S L K L Y 



GGCCTTTTCC CCATTCAGCA 
A F S P F S I 



GAACACCAAA ACAAACCTGG 
N T K T N L E 



CCAGGCCCTG AAGGGCTTCA 
Q A L K G F T 



CCCAGACCTG GCCATAAGGG 
P D L A I R D 



CCCCAGAGTC CTAAGCAACA 
P R V L S N N 



CAAGAACACC AACAACAAGA 
K N T N N K I 



TGTCCTCCTC AATGC TAT C T 
V L L N A I Y 



CCGCTGACGT CGCCGCCCAG 60 



TGCTGGCTGG GG AT AG AGC C 12 0 
LAG D R A 20 



CAGAGAGTTT GCAAGACAGA 180 
E S L Q D R 40 



TATTCGTTGA ACCCATCCTG 240 
F V E P I L 60 



CAGCCACCAA AATAACAGCT 3 00 
A T K I T A 80 



CCACCACCCA ACCCACCATC 3 60 
T T Q P T I 100 



CTACCCAGCC CACTACTGGG 420 
T Q P T T G 120 



AGAGTCATTC AACAGAGGCC 480 
S H S TEA 140 



ACCACGCCTT CTCAGGAATG 540 
H A F S G M 160 



TCGCCAGCCT CCTTACCCAG 600 
A S L L T Q 180 



AG AGC AT C C T CTCTTACCCC 660 
S I L S Y P 200 



CGACCAAAGG TGTCACCTCA 72 0 
T K G V T S 220 



ACACCTTTGT G AATGC C T C T 7 80 
T F V N A S 240 



ACAGTGACGC CAACTTGGAG 840 
S D A N L E 260 



TCAGCCGGCT GC TAG AC AGT 9 00 
S R L L D S 280 



ACCTGAGTGC CAAGTGGAAG 960 
L S A K W K 300 
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Figure 14B 



9 61 ACAACATTTG ATCCCAAGAA AACCAGAATG GAACCCTTTC ACTTCAAAAA CTCAGTTATA 102 0 
301TTFD PKK TRM EPFH FKN SVI 320 

1021 AAAGTGCCCA TGATGAATAG CAAGAAGTAC CCTGTGGCCC ATTTCATTGA CCAAACTTTG 1080 
321 KVPM MNS KKY PVAH FID QTL 340 

1081 AAAGCCAAGG TGGGGCAGCT GCAGCTCTCC CACAATCTGA GTTTGGTGAT CCTGGTACCC 1140 
341KAKV GQL QLS HNLS L V I LVP 360 

1141 CAGAACCTGA AACATCGTCT TGAAGACATG GAACAGGCTC TCAGCCCTTC TGTTTTCAAG 12 00 
361 QNLK HRL EDM EQAL SPS VFK 380 



12 01 GCCATCATGG AG AAAC TGG A GATGTCCAAG TTCCAGCCCA CTCTCCTAAC ACTACCCCGC 12 60 
381 AIME KLE MSK FQPT LLT LPR 400 

12 61 ATCAAAGTGA CGACCAGCCA GGATATGCTC TCAATCATGG AGAAAT TGG A ATTCTTCGAT 132 0 
401 IKVT TSQ DML SIME KLE F FD 420 

1321 TTTTCTTATG ACCTTAACCT GTGTGGGCTG AC AGAGG AC C CAGATCTTCA GGTTTCTGCG 13 80 
421 FSYD LNL CGL TEDP DLQ VSA 440 

13 81 ATGCAGCACC AGACAGTGCT GGAAC TGAC A GAGAC TGGGG TGGAGGC GGC TGCAGCCTCC 1440 
441 MQHQ TVL ELT ETGV E A A A A S 460 

1441 GCCATCTCTG TGGCCCGCAC CCTGCTGGTC TTTGAAGTGC AGCAGCCCTT CCTCTTCGTG 1500 
461 AISV ART LLV FEVQ QPF LFV 480 



1501 CTCTGGGACC AGCAGCACAA GTTCCCTGTC TTCATGGGGC GAGTATATGA CCCCAGGGCC 1560 
481 LWDQ QHK FPV FMGR VYD PRA 500 



1561 TGAGACCTGC AGGATCAGGT TAGGGCGAGC GCTACCTCTC CAGCCTCAGC TCTCAGTTGC 162 0 



1621 AGCCCTGCTG CTGCCTGCCT GGACTTGCCC CTGCCACCTC CTGCCTCAGG TGTCCGCTAT 1680 



1681 CCACCAAAAG GGCTCCTGAG GGTCTGGGCA AGGGACCTGC TTCTATTAGC CCTTCTCCAT 17 40 
1741 GGCCCTGCCA TGCTCTCCAA ACCACTTTTT GCAGCTTTCT CTAGTTCAAG TTCACCAGAC 1800 



1801 TCTATAAATA AAAC C TGAC A GACCAT 182 6 
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1 AGTCTGCACT GGAGCTGCCT 



61 ATGGCCTCCA GGCTGACCCT 
1MASR LTL 



121 TCCTCAAATC CAAATGCTAC 
21SSNP NAT 



181 GGC GAAGGGA AGGTCGCAAC 
41GEGK VAT 



241 GAGGTT T CCA GCTTGCCGAC 
61EVSS LPT 



3 01 AATACCACTG AT G AAC C C AC 
81NTTD EPT 



3 61 CAACCCACCC AAC C AAC T AC 
101 Q P T Q P T T 



421 TCCTTCTGCC CAGGACCTGT 
121 S F C P G P V 



481 GTGTTGGGGG ATGCTTTGGT 
141 V L G D A L V 



541 AAGAAGGTGG AGACCAACAT 
161 K K V E T N M 



601 GTCCTGCTCG GGGCTGGGCA 
181 V L L G A G Q 



6 61 AAGGACTTCA CCTGTGTCCA 
201 K D F T C V H 



721 GTCTCTCAGA TC TTCC AC AG 
221 V S Q I F H S 



781 CGGACCCTGT ACAGCAGCAG 
241 R T L Y S S S 



841 CTCATCAACA CCTGGGTGGC 
261 LINT W V A 



9 01 CTGCCCTCCG ATACCCGCCT 
281 L P S D T R L 



Figure 15A 

GGTGACCAGA AGTTTGGAGT 



GCTGACCCTC CTGCTGCTGC 
LTL L L L L 



CAGCTCCAGC TCCCAGGATC 
S S S S Q D P 



AACAGTTATC TCCAAGATGC 
T V I S K M L 



AACCAACTCA ACAACCAATT 
T N S T T N S 



CACACAACCC AC C AC AGAGC 
T Q P T T E P 



CCAGCTCCCA AC AG AT T C T C 
Q L P T D S P 



TACTCTCTGC TCTGACTTGG 
T L C S D L E 



AGATTTCTCC CTGAAGCTCT 
D F S L K L Y 



GGCCTTTTCC CCATTCAGCA 

A F S P F S I 

GAACACCAAA ACAAACCTGG 

N T K T N L E 



CCAGGCCCTG AAGGGC T TC A 
Q A L K G F T 



CCCAGACCTG GCCATAAGGG 
P D L A I R D 



CCCCAGAGTC CTAAGCAACA 
P R V L S N N 



CAAGAACACC AACAACAAGA 
K N T N N K I 



TGTCCTCCTC AATGCTATCT 
V L L N A I Y 



CCGCTGACGT CGCCGCCCAG 6 0 



TGCTGGCTGG GGATAGAGCC 12 0 
LAG D R A 20 



CAGAGAGTTT GCAAGACAGA 180 
E S L Q D R 40 



TATTCGTTGA ACCCATCCTG 240 
F V E P I L 60 



CAGCCACCAA AATAACAGCT 3 00 
A T K I T A 80 



CCACCACCCA ACCCACCATC 360 
T T Q P T I 100 



CTACCCAGCC C AC T AC TGGG 42 0 
T Q P T T G 120 



AGAGTC ATT C AACAGAGGCC 480 
S H S TEA 140 



ACCACGCCTT CTCAGCAATG 540 
H A F SAM 160 



TCGCCAGCCT CCTTACCCAG 600 
A S L L T Q 180 



AGAGCATCCT CTCTTACCCC 660 
S I L S Y P 200 



C G AC C AAAGG TGTCACCTCA 72 0 
T K G V T S 220 



ACACCTTTGT GAATGCCTCT 7 80 
T F V N A S 240 



ACAGTGACGC CAACTTGGAG 840 
S D A N L E 260 



TCAGCCGGCT GCTAGACAGT 9 00 
S R L L D S 280 



ACCTGAGTGC CAAGTGGAAG 9 60 
L S A K W K 300 
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Figure 15B 



961 ACAACATTTG ATCCCAAGAA AAC C AGAATG GAACCCTTTC ACTTCAAAAA CTCAGTTATA 102 0 
301 TTFD PKK TRM EPFH FKN SVI 320 



1021 AAAGTGC C C A TGATGAATAG CAAGAAGTAC CCTGTGGCCC ATTTCATTGA CCAAACTTTG 10 80 
•321 KVPM MNS KKY PVAH FID Q T L 340 

10 81 AAAGCCAAGG TGGGGCAGCT GCAGCTCTCC CACAATCTGA GTTTGGTGAT CCTGGTACCC 1140 
341 KAKV GQL QLS HNLS L V I LVP 360 

1141 C AG AAC C TG A AACATCGTCT TGAAGACATG GAACAGGCTC TCAGCCCTTC TGTTTTCAAG 12 00 
361 QNLK HRL EDM EQAL SPS VFK 380 

12 01 GCCATCATGG AGAAACTGGA GATGTCCAAG TTCCAGCCCA CTCTCCTAAC ACTACCCCGC 12 60 
381 AIME KLE MSK FQPT LLT LPR 400 

12 61 ATCAAAGTGA CGACCAGCCA GGATATGCTC TCAATCATGG AGAAAT TGG A ATTCTTCGAT 13 2 0 
401 IKVT TSQ DML SIME KLE FFD 420 

13 21 TTTTCTTATG ACCTTAACCT GTGTGGGC TG AC AGAGG AC C CAGATCTTCA GGTTTCTGCG 13 80 
421 FSYD LNL CGL TEDP DLQ VSA 440 

13 81 ATGCAGCACC AG AC AGTGC T GGAAC TG AC A GAGAC TGGGG TGGAGGCGGC TGCAGCCTCC 1440 
441 MQHQ T V L ELT ETGV E A A AAS 460 

1441 GCCATCTCTG TGGCCCGCAC CCTGCTGGTC TTTGAAGTGC AGCAGCCCTT CCTCTTCATG 15 00 
461 AISV ART LLV FEVQ QPF LFM 480 



1501 CTCTGGGACC AGCAGCACAA GTTCCCTGTC TTCATGGGGC GAGT AT AT G A CCCCAGGGCC 1560 

481 LWDQ QHK FPV FMGR VYD PRA 500 

1561 TGAGACCTGC AGGATCAGGT TAGGGCGAGC GCTACCTCTC CAGCCTCAGC TCTCAGTTGC 162 0 

1621 AGCCCTGCTG CTGCCTGCCT GGACTTGCCC CTGCCACCTC CTGCCTCAGG TGTCCGCTAT 1680 



16 81 CCACCAAAAG GGCTCCTGAG GGTCTGGGCA AGGGACCTGC TTCTATTAGC CCTTCTCCAT 1740 



1741 GGCCCTGCCA TGCTCTCCAA ACCACTTTTT GCAGCTTTCT CTAGTTCAAG TTCACCAGAC 180 0 



1801 TCTATAAATA AAACCTGACA GACCAT 182 6 
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1 TCCTCCACCT GCTGGCCCCT 
1 



61 GCCCTGTCCC TGGGGGGGAC 
9ALSL GGT 



121 TGGGAGTGTG AGCAGCATTC 
29WECE QHS 



181 CAGTGTGGGG GCATCCTGGT 
49QCGG I L V 



241 GACAATTACC AGCTCTGGCT 
69DNYQ L W L 



3 01 TTTGTTCATG TCAGTGAGAG 
89FVHV SES 



361 CACACCCGCC AAGCAGACGA 
109 H T R Q A D E 



421 CCTGCTGATA CCATCACAGA 
129 P A D T I T D 



481 GTGGGGAGCA CCTGTTTGGC 
149 V G S T C L A 



541 CCAGATGATC TCCAGTGTGT 
169 P D D L Q C V 



601 CACGTCCAGA AGGT G AC AG A 
189 H V Q K V T D 



661 ACCTGTGTGG GTGATTCAGG 
209 T C V G D S G 



721 TCATGGGGCT ACGTCCCTTG 
229 S W G Y V P C 



7 81 TCTTATGTGA AGTGGATCGA 
249 S Y V K W I E 



Figure 16 

GGACACCTCT GTCACCATGT 
M W 



TGGTGCTGCG CCCCCGATTC 
G A A P P I Q 



CCAGCCCTGG CAGGCGGCTC 
Q P W Q A A L 



GCACCGCCAG TGGGTGC TC A 
H R Q W V L T 



GGGTCGCCAC AACTTGTTTG 
G R H N L F D 



CTTCCCACAC CCTGGCTTCA 
F P H P G F N 



GGACTACAGC CACGACCTCA 
D Y S H D L M 



TGCTGTGAAG GTCGTGGAGT 
A V K V V E L 



TTCCGGCTGG GGCAGCATCG 
S G W G S I E 



GG AC C T C AAA ATCCTGCCTA 
D L K I L P N 



CTTCATGCTG TGTGTCGGAC 
F M L C V G H 



GGGCCCGCTG ATGTGTGATG 
GPL M C D G 



TGGCACCCCC AATAAGCCTT 
G T P N K P S 



GG AC AC CAT A GCGGAGAACT 
DTI A E N S 



GGTTCCTGGT TCTGTGCCTC 6 0 
F L V L C L 8 



AGTCCCGGAT TGTGGGAGGC 12 0 
SRI V G G 28 



TGTACCATTT CAGCACTTTC 180 
Y H F S T F 48 



CAGCTGCTCA TTGCATCAGC 240 
A A H CIS 68 



ACGACGAAAA CACAGCCCAG 3 00 
DEN T A Q 88 



AC ATGAGC C T CCTGGAGAAC 3 60 
M S L L E N 108 



TGCTGCTCCG CCTGACAGAG 42 0 
L L R L T E 128 



TGCCCACCGA GGAACCCGAA 480 
PTE E P E 148 



AACCAGAGAA TTTCTCATTT 540 
PEN F S F 168 



ATGATGAGTG CAAAAAAGCC 6 00 
DEC K K A 188 



AC C TGGAAGG TGGCAAAGAC 660 
LEG G K D 208 



GTGTGCTCCA AGGTGTCACA 720 
V L Q G V T 228 



CTGTCGCCGT CAGAGTGCTG 7 80 
V A V R V L 248 



CCTGAACGCC CAGCCCTGTC 840 
* 262 



841 CCCTACCCCC AGTAAAATCA AATGTGCATC C 871 
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1 TCCTCCACCT GCTGGCCCCT 
1 

61 GCCCTGTCCC TGGGGGGGAC 
9 A L S L GGT 



121 TGGGAGTGTG AGCAGCATTC 
29WECE QHS 



181 CAGTGTGGGG GCATCCTGGT 
49QCGG ILV 



2 41 GACAATTACC AGCTCTGGCT 
69 D N Y Q LWL 



301 TTTGTTCATG TCAGTGAGAG 
89FVHV SES 



3 61 CACACCCGCC AAGCAGACGA 
109 H T R Q A D E 



421 CCTGCTGATA C C ATC AC AG A 
129 P A D T I T D 



481 GT GGGGAGC A CCTGTTTGGC 

149 V G S T C L A 

541 C C AG AT G ATC TCCAGTGTGT 

169 P D D L Q C V 



601 C AC GT C C AGA AGGTGACAGA 
189 H V Q K V T D 



661 ACCTGTGTGG GTGATTCAGG 
209 T C V G D S G 



721 TCATGGGGCT ACGTCCCTTG 
229 S W G Y V P C 



7 81 TCTTATGTGA AGTGG AT C G A 
249 S Y V K W I E 



Figure 17 

GGACACCTCT GTCACCATGT 
M W 



TGGTGCTGCG CCCCCGATTC 
G A A P P I Q 



CCAGCCCTGG CAGGCGGCTC 
Q P W Q A A L 



GCACCGCCAG TGGGTGC TC A 
H R Q W V L T 



GGGTCGCCAC AACTTGTTTG 
G R H N L F D 



CTTCCCACAC CCTGGCTTCA 
F P H P G F N 



GGACTACAGC CACGACCTCA 
D Y S H D L M 



TGCTGTGAAG GT C GTGGAGT 
A V K V V E L 



TTCCGGCTGG GGCAGCATCG 
S G W G S I E 



GGAC C TC AAA ATCCTGCCTA 
D L K I L P N 



CTTCATGCTG TGTGTCGGAC 
F M L C V G H 



GGGCCCGCTG ATGTGTGATG 
GPL M C D G 



TGGCACCCCC AATAAGCCTT 
G T P N K P S 



GGAC AC CAT A GCGGAGAACT 
DTI A E N S 



GGTTCCTGGT TCTGTGCCTC 60 
F L V L C L 8 



AGTCCCGGAT TGT GGGAGGC 12 0 
SRI V G G 28 



TGT AC CAT TT CAGCACTTTC 180 
Y H F S T F 48 



CAGCTGCTCA TTGCATCAGC 240 
A A H CIS 68 



ACGACGAAAA CACAGCCCAG 300 
DEN T A Q 88 



ACATGAGCCT CCTGGAGAAC 3 60 
M S L L E N 108 



TGCTGCTCCG CCTGACAGAG 42 0 
L L R L T E 128 



TGCCCACCGA GGAACCCGAA 480 
PTE E P E 148 



AACCAGAGAA TTTCTCATTT 540 
PEN F S F 168 



ATGATGAGTG CGAAAAAGCC 600 
DEC E K A 188 



ACCTGGAAGG TGGCAAAGAC 660 
LEG G K D 208 



GTGTGCTCCA AGGTGTCACA 72 0 
V L Q G V T 228 



CTGTCGCCGT CAGAGTGCTG 7 80 
V A V R V L 248 



CCTGAACGCC CAGCCCTGTC 840 

262 



841 CCCTACCCCC AGTAAAATCA AATGTGCATC C 871 
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Figure 18 

1 TCCTCCACCT GCTGGCCCCT GG AC AC C T C T GTCACCATGT GGTTCCTGGT TCTGTGCCTC 60 
1 MWFLVLCL8 

61 GCCCTGTCCC TGGGGGGGAC TGGTGCTGCG CCCCCGATTC AGTCCCGGAT TGTGGGAGGC 12 0 

9 A L S L GGT G A A PPIQ SRI VGG 28 

121 TGGGAGTGTG AGCAGCATTC CCAGCCCTGG CAGGCGGCTC TGTACCATTT CAGCACTTTC 180 

29WECE QHS QPW Q A A L YHF STF 48 

181 CAGTGTGGGG GCATCCTGGT GCACCGCCAG TGGGTGCTCA CAGCTGCTCA TTGCATCAGC 240 

49QCGG ILV HRQ WVLT A A H CIS 68 

241 GACAATTACC AGCTCTGGCT GGGTCGCCAC AACTTGTTTG AC G AC GAAAA CACAGCCCAG 300 

69DNYQ LWL GRH NLFD DEN TAQ 88 

3 01 TTTGTTCATG TCAGTGAGAG CTTCCCACAC CCTGGCTTCA ACATGAGCCT CCTGGAGAAC 360 

89FVHV SES FPH PGFN MSL LEN 108 

3 61 CACACCCGCC AAGCAGACGA GGACTACAGC CACGACCTCA TGCTGCTCCG CCTGACAGAG 420 

109 HTRQ ADE DYS HDLM L L R LTE 128 



fee* ...... 

y] 421 CCTGCTGATA C CATC AC AG A TGC TGTGAAG GTCGTGGAGT TGCCCACCCA GGAACCCGAA 480 

m 12 9PADT ITD A V K VVEL PTQ EPE 148 

: i= : * — 

N 481 GTGGGGAGCA CCTGTTTGGC TTCCGGCTGG GGCAGCATCG AACCAGAGAA TTTCTCATTT 540 

Ipl J 149 VGST C L» A SGW GSIE PEN FSF 168 

O 



541 CCAGATGATC TCCAGTGTGT GGACCTCAAA ATCCTGCCTA ATGATGAGTG CAAAAAAGCC 600 
169 PDDL QCV D L K ILPN DEC KKA 188 

6 01 CACGTCCAGA AGGTGACAGA CTTCATGCTG TGTGTCGGAC ACCTGGAAGG TGGCAAAGAC 660 
189 HVQK VTD FML CVGH LEG GKD 208 

661 ACCTGTGTGG GTGATTCAGG GGGCCCGCTG ATGTGTGATG GTGTGCTCCA AGGTGTCACA 72 0 
209 TCVG DSG GPL MCDG VLQ GVT 228 

721 TCATGGGGCT ACGTCCCTTG TGGCACCCCC AATAAGCCTT CTGTCGCCGT CAGAGTGCTG 780 
229 SWGY VPC GTP NKPS VAV RVL 248 

7 81 TCTTATGTGA AGTGGATCGA GGACACCATA GC GG AG AAC T CCTGAACGCC CAGCCCTGTC 840 
249 SYVK WIE DTI AENS * 262 



841 CCCTACCCCC AGTAAAATCA AATGTGCATC C 871 
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Figure 19 
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1 CTGTGCATGG CATCATCCTG 
1 MA S S W 



61 TTCCCTCAAA ATGCTACGGC 
19FPQN ATA 



121 GTGCTGCCNA C ATT TAT CAT 
39 V L P T FII 



181 CTGTTGGTCT TCCTCCTGCC 
59 L L V F LLP 



241 CTGGCAGCCT CTGATCTGGT 
79LAAS DLV 



3 01 AACCAGTTTA ACTGGCCTTT 
99NQFN WPF 



3 61 GCCAATTTGT TCATCAGCAT 
119 A N L F I S I 



421 CTGGTGCACC CTATGGCCAG 
139 L V H P MAS 



481 GTGCTCATCT GGGT TGTGGG 

159 V L I W V V G 

541 CAAGCCGTCC CAGATCTGAA 

179 Q A V P D L N 



601 CACTTTGCAA GGATTGTGGA 
199 H F A R I V E 



661 GTCTTCTTCA ACTACCACAT 
219 V F F N Y H I 

721 AG AGTGC NGG GGCCGAAGGA 
239 R V X G P K D 



781 TTCCTGGTCT GCTGGGCCCC 
259 F L V C WAP 



841 CAAGCAGTCC GAGGCTGCTT 
279 Q A V R G C F 



Figure 21A 

GCCCCCTCTA GAGCTCCAAT 
PPL E L Q S 

CTGTGACAAT GCTCCAGAAG 
C D N A P E A 

CTCCATCTGT TTCTTCGGCC 
SIC F F G L 



CCGGCGGCAA CTGAACGTGG 
R R Q L N V A 



GTTTGTCTTG GGCTTGCCCT 

F V L G L P F 

CGGAGCCCTC CTCTGCCGTG 

GAL L C R V 



CTTCCTGGTG GTGGCCATCA 
F L V V A I S 



C GGAAGGC AG CAGCGGCGGA 
G R Q Q R R R 



GGGCCTCTTG AGCATCCCCA 
G L L S I P T 



CATCACCGCC TGCATCNTGC 
I T A C I X L 



GTTAAATATT CTGGGTTTCC 

L N I L G F L 

CCTGGCCTCC CTGCGAACGC 

LAS L R T R 



TAGCAAGACC ACAGCGCTGA 
S K T TALI 



TTACCACTTC TTTGCCTTCC 
Y H F F A F L 



TTGGGAGGAC TTCATTGACC 
WED F I D L 



CCTCCAACCA GAGCCAGCTC 60 
S N Q SQL 18 

CCTGGGACCT GCTGCACAGA 12 0 
W D L L H R 38 



TCCTAGGGAA CCTTTTTGTC 180 
L G N L F V 58 



CAGAAATCTA CCTGGCCAAC 240 
E I Y LAN 78 



TCTGGGCAGA GAATATCTGG 3 00 
W A E N I W 98 

TCATCAANGG GGTCATCAAG 3 60 
I N G V I K 118 



GCCAGGACCG CTACCGCGTG 42 0 
Q D R Y R V 138 



GNCAGGCCCG GGTCACCTGC 480 
Q A R V T C 158 



CATTCCTGCT GCGATCCATC 540 
F L L RSI 178 



TCCTCCCCCA TGAGGCCTGG 600 
L P H E A W 198 



TCCTACCACT GGCTGCGATC 660 
L P L A A I 218 

GGGANGAGGT CAGCAGGACA 72 0 
E E V S R T 238 



TCCTCACGCT CGTGGTTGCC 780 
L T L V V A 258 



TGGAATTCTT ATTCCAGGTG 840 
E F L F Q V 278 



TGGGCCTGCA ATTGGCCAAC 900 
G L Q LAN 298 
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901 TTCTTTGCCT TCACTAACAG 
299 F F A F T N S 



9 61 TTCAGGACCA AGGTC TGGG A 
319 F R T K V W E 



1021 TCTTCATCCC ATAGGAAAGA 
339 S S S H R K E 



Figure 2 IB 

CTCCCTGAAT CCAGTAATTT 
S L N P V I Y 



AC T TT AT AAA CAATGCACCC 
L Y K Q C T P 



AATCTTCCAA CTTTTCTGGC 
I F Q L F W R 



ATGTCTTTGT GGGCCNGCTC 960 
V F V G X L 318 



CTAAAAGTCT TGCTCCAATA 102 0 
K S L API 338 



GGAATTAAAA CAGCATTGAA 1080 
N * 353 



1081 CC 1082 
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1 AATTCAGAGC CACCGCGGGC 



61 CAGTTCAGCT TTCAAAAAGA 



121 AG AAGG AC C C TGAGCCCCAG 



181 CAGATAGTAG GCTTTACGCC 
1 



241 CTCTCCCCAA ACATCTCCAC 
11LSPN 1ST 



3 01 CAAATTGTCC TTTGGGCAGC 
31QIVL W A A 



361 GTGGTAGTGA TGTGGATCAT 
51 V V V M WII 



421 CTGGTGAACC TGGCCTTCGC 
71LVNL A F A 



481 ACCTATGCTG TCCACAACGA 
91TYAV HNE 



541 TTNCCCATCG CCGCTGTCTT 
111 F P I A A V F 



601 TACATGGCCA TCATACATCC 
131 Y M A I I H P 



661 ATCTGTGTCA TNTGGGTCCT 
151 I C V I W V L 



721 ACAGAGACCA TGCCCAGCAG 
171 T E T M P S R 



7 81 ATT TAT GAGA AAGTGTACCA 
191 I Y E K V Y H 



841 GTGATTGGCT AT GC AT AC AC 
211 V I G Y A Y T 



9 01 GACTCCTCTG ACCGCTACCA 
231 D S S D R Y H 



Figure 22A 

AGGC GGGC AG TGCATCCAGA 



GTGCTGCCCA TAAAAAGCCT 



GCGCCAGCCA CAGGACTCTG 



TAGCTTCGAA ATGGATAACG 
M D N V 



TAACACCTCG GAACCCAATC 
NTS E P N Q 



TGCCTACACG GTCATTGTGG 
A Y T V I V V 



CTTAGCCCAC AAAAGAATGA 
L A H K R M R 



GGAGGCCTCC ATGGCTGCAT 
E A S M A A F 



ATGGT AC T AC GGCCTGTTCT 
W Y Y G L F Y 



CGCCAGTATC TACTCCATGA 
A S I Y S M T 



CCTCCAGCCC CGGCTGTCAG 
L Q P R L S A 



GGCTCTCCTG CTGGCCTTCC 
ALL L A F P 



AGTCGTGTGC ATGATCGAAT 
V V C M I E W 



CATC TGTGTG AC TGTGC TG A 
I C V T V L I 



CGTAGTGGGA ATC AC AC TAT 
V V G I T L W 



CGAGCAAGTC TCTGCCAAGC 
E Q V S A K R 



AGCGTTTATA TTCTGAGCGC 60 



TCCACCCTCC TGTCTGCTTT 120 



CTGCAGAGGG GGGTT GTGT A 180 



TCCTCCCGGT GGACTCAGAC 240 
L P V DSD 10 



AGTTCGTGCA ACCAGCCTGG 300 
F V Q PAW 30 



TGACCTCTGT GGTGGGCAAC 360 
T S V V G N 50 



GGACAGTGAC GAACTATTTT 420 
T V T N Y F 70 



TCAATACAGT GGTGAACTTC 480 
N T V V N F 90 



ACTGCAAGTT CCACAACTTC 540 
C K F H N F 110 



CGGCTGTGGC CTTTGATAGG 600 
A V A FDR 130 



CCACAGCCAC CAAAGTGGTC 660 
TAT K V V 150 



CCCAGGGCTA CTACTCAACC 72 0 
Q G Y Y S T 170 



GGCCAGAGCA TCCGAACAAG 7 80 
P E H P N K 190 



TCTACTTCCT CCCCCTGCTG 840 
Y F L P L L 210 



GGGCCAGTGA GATCCCCGGG 900 
A S E IPG 230 



GCAAGGTGGT CAAAATGATG 960 
K V V K M M 250 
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961 ATTGTCGTGG TGTGCACCTT 
251 I V V V C T F 



1021 CCCTACATCA ACCCAGATCT 
271 P Y I N P D L 



1081 ATGTGGC TGG CCATGAGCTC 
291 M W L A M S S 



1141 AGGTTCCGTC TGGGCTTCAA 
311 R F R L G F K 



12 01 TATGAGGGGC TGGAAATGAA 
331 Y E G L E M K 



12 61 GTCAGCCGCC TGGAGACCAC 
351 V S R L E T T 



1321 GACGGCCCCA AGGCCACACC 
371 D G P K ATP 



13 81 GACTCCAAGA C C ATGAC AG A 
391 D S K T M T E 



1441 GGGCCTTTGG CAGGTGCAGC 



1501 TCCCTTCATC TGGAACCATC 



1561 TGGGTTAGGG AAAACATTCC 



1621 CACCCTCATG CTGTGTGACT 



1681 AGGTCGGACC AGCTTTTCCT 



Figure 22B 

CGCCATCTGC TGGCTGCCCT 
A I C W L P F 



CTACCTGAAG AAGTTTATCC 
Y L K K F I Q 



CACCATGTAC AACCCCATCA 
T M Y N P I I 



GCATGCCTTC CGGTGCTGCC 

H A F R C C P 

ATCCACCCGG TATCTCCAGA 

S T R Y L Q T 



CATCTCCACA GTGGTGGGGG 
1ST V V G A 



CTCNTCCCTG GACCTGACCT 
SSL D L T S 



GAGCTTCAGC TTCTCCTCCA 
S F S F S S N 



CCCCACTGCC TTTGACCTGC 
AGAAAC AC C C TCACACTGGG 
ATCCTTGAGT CAAAAAATCT 



CAAACCAAAT CACTGAACTT 



CAAGAGCCCA ATGCATTCCA 



TCCACATCTT CTTCCTCCTG 102 0 
H I F F L L 270 



AGCAGGTCTA CCTGGCCATC 1080 
Q V Y LAI 290 



TCTACTGCTG CCTCAATGAC 1140 
Y C C L N D 310 



CCTTCATCAG CGCCGGCGAC 12 00 
F I S A G D 330 



CCCAGGGCAG TGTGTACAAA 12 60 
Q G S V Y K 350 

CCCACGAGGA GGAGCCAGAG 1320 
H E E E P E 370 



CCAACTGCTC TTCACGAAGT 13 80 
N C S S R S 390 



ATGTGCTCTC CTAGGCCACA 1440 
V L S * 407 



CTCCCTTCAT GCATGGAAAT 1500 
ACTTGCAAAA AGGGTCAGTA 1560 



CAATTCTTCC CTATCTTTGC 162 0 



TGCTGAGCCT GTAAAATAAA 1680 



TTTCTGGAAG TGACTTTGGC 1740 



1741 TGCATGCGAG TGCTCATTTC AGGATG 17 66 
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1 AGTCTGCACT GGAGCTGCCT 



61 ATGGCCTCCA GGCTGACCCT 
1MASR LTL 



121 TCCTCAAATC C AAATGC T AC 
21SSNP NAT 



181 GGCGAAGGGA AGGTCGCAAC 
41GEGK VAT 



241 GAGGTTTCCA GCTTGCCGAC 
61EVSS LPT 



3 01 AATACCACTG ATGAACCCAC 
81NTTD EPT 



361 CAACCCACCC AACCAACTAC 
101 Q P T Q P T T 



421 TCCTTCTGCC CAGGACCTGT 
121 S F C P G P V 



Figure 23A 

GGTGACCAGA AGTTTGGAGT 



GCTGACCCTC CTGCTGCTGC 
LTL L L L L 



CAGCTCCAGC TCCCAGGATC 
S S S S Q D P 



AACAGTTATC TCCAAGATGC 
T V I S K M L 



AACCAACTCA ACAACCAATT 
T N S T T N S 



CACACAACCC ACCACAGAGC 
T Q P T T E P 



CCAGCTCCCA ACAGATTCTC 
Q L P T D S P 



TACTCTCTGC TCTGACTTGG 
T L C S D L E 



CCGCTGACGT CGCCGCCCAG 60 



TGCTGGCTGG GGATAGAGCC 12 0 
LAG D R A 20 



CAGAGAGTTT GCAAGACAGA 180 
E S L Q D R 40 



TATTCGNTGA ACCCATCCTG 240 
F X E P I L 60 



CAGCCACCAA AATAACAGCT 3 00 
A T K I T A 80 



CCACCACCCA ACCCACCATC 3 60 
T T Q P T I 100 



CTACCCAGCC C AC T AC TGGG 42 0 
T Q P T T G 120 



AGAGTCATTC AACAGAGGCC 480 
S H S TEA 140 



481 GTGTTGGGGG ATGCTTTGGT AGATTTCTCC CTGAAGCTCT ACCACGCCTT CTCAGNAATG 540 
141 VLGD ALV DFS LKLY HAF SXM 160 



541 AAGAAGGTGG AGACCAACAT 
161 K K V E T N M 



601 GTCCTGCTCG GGGC TGGGC A 
181 V L L G A G Q 



661 AAGGACTTCA CCTGTGTCCA 
201 K D F T C V H 



721 GTCTCTCAGA TCTTCCACAG 
221 V S Q I F H S 



781 CGGACCCTGT ACAGCAGCAG 
241 R T L Y S S S 



841 CTCATCAACA CCTGGGTGGC 
261 LINT W V A 



9 01 CTGCCCTCCG ATACCCGCCT 
281 L P S D T R L 



GGCCTTTTCC CCATTCAGCA 
A F S P F S I 



GAAC AC C AAA ACAAACCTGG 
N T K T N L E 



CCAGGCCCTG AAGGGCTTCA 
Q A L K G F T 



CCCAGACCTG GCCATAAGGG 
P D L A I R D 



CCCCAGAGTC CTAAGCAACA 
P R V L S N N 



CAAGAACACC AACAACAAGA 
K N T N N K I 



TGTCCTCCTC AATGCTATCT 
V L L N A I Y 



TCGCCAGCCT CCTTACCCAG 600 
A S L L T Q 180 



AGAGCATCCT CTCTTACCCC 660 
S I L S Y P 200 



CGACCAAAGG TGTCACCTCA 72 0 
T K G V T S 220 



ACACCTTTGT GAATGCCTCT 7 80 
T F V N A S 240 



AC AGTGAC GC CAACTTGGAG 840 
S D A N L E 260 



TCAGCCGGCT GCTAGACAGT 900 
S R L L D S 280 



AC C TGAGTGC CAAGTGGAAG 960 
L S A K W K 300 
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961 ACAACATTTG ATCCCAAGAA AACCAGAATG 
301 TTFD PKK TRM 



1021 AAAGTGC C C A TGATGAATAG CAAGAAGTAC 
321 KVPM MNS KKY 



10 81 AAAGCCAAGG TGGGGCAGCT GCAGCTCTCC 
341 KAKV GQL QLS 



1141 CAGAACCTGA AACATCGTCT TGAAGACATG 
361 QNLK HRL EDM 

12 01 GCCATCATGG AG AAAC TGG A GATGTCCAAG 
381 AIME KLE MSK 



12 61 ATCAAAGTGA CGACCAGNCA GGATATGCTC 
401 IKVT TSQ DML 



13 21 TTTTCTTATG AC C TTAACC T GTGTGGGCTG 
421 FSYD L N L CGL 



13 81 ATGCAGCACC AG AC AGTGC T GGAAC TG AC A 
441 MQHQ TVL ELT 



1441 GCCATCTCTG TGGCCCGCAC CCTGCTGGTC 
461 AISV ART LLV 



1501 CTCTGGGACC AGCAGCACAA GTTCCCTGTC 
481 L W D Q QHK FPV 



1561 T GAG AC C T GC AGGATCAGGT TAGGGCGAGC 



1621 AGCCCTGCTG CTGCCTGCCT GGACTTGCCC 



1681 CCACCAAAAG GGCTCCTGAG GGTCTGGGCA 



1741 GGCCCTGCCA TGCTCTCCAA ACCACTTTTT 
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GAACCCTTTC ACTTCAAAAA CTCAGTTATA 102 0 
EPFH FKN SVI 320 



CCTGTGGCCC ATTTCATTGA CCAAACTTTG 1080 
PVAH FID Q T L 340 



CACAATCTGA GTTTGGTGAT CCTGGTACCC 1140 
H N L S L V I LVP 360 



GAACAGGCTC TCAGCCCTTC TGTTTTCAAG 12 00 
EQAL SPS VFK 380 



TTCCAGCCCA CTCTCCTAAC ACTACCCCGC 1260 
FQPT LLT L P R 400 



TCAATCATGG AG AAAT TGGA ATTCTTCGAT 132 0 
SIME KLE FFD 420 



AC AGAGGAC C CAGATCTTCA GGTTTCTGCG 13 80 

TEDP DLQ VSA 440 

GAGAC TGGGG TGGAGGCGGC TGCAGCCTCC 1440 

ETGV E A A A A S 460 



TTTGAAGTGC AGCAGCCCTT CCTCTTCNTG 1500 
FEVQ QPF L F X 480 



TTCATGGGGC GAGTATATGA CCCCAGGGCC 15 60 
FMGR VYD PRA 500 



GCTACCTCTC CAGCCTCAGC TCTCAGTTGC 162 0 



CTGCCACCTC CTGCCTCAGG TGTCCGCTAT 1680 



AGGGACCTGC TTCTATTAGC CCTTCTCCAT 1740 



GCAGCTTTCT CTAGTTCAAG TTCACCAGAC 1800 



18 01 T C TAT AAAT A AAAC C TG AC A GACCAT 182 6 
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Figure 24 

1 TCCTCCACCT GCTGGCCCCT GGACACCTCT GTCACCATGT GGTTCCTGGT TCTGTGCCTC 60 
1 MWFLVLCL8 

61 GCCCTGTCCC TGGGGGGGAC TGGTGCTGCG CCCCCGATTC AGTCCCGGAT TGTGGGAGGC 12 0 
9ALSL GGT GAA PPIQ SRI VGG 28 

121 TGGGAGTGTG AGCAGCATTC CCAGCCCTGG CAGGCGGCTC TGTACCATTT CAGCACTTTC 180 
29WECE QHS QPW QAAL YHF STF 48 

181 CAGTGTGGGG GCATCCTGGT GCACCGCCAG TGGGTGC TC A CAGCTGCTCA TTGCATCAGC 240 
49QCGG I L V HRQ WVLT A A H CIS 68 

241 GACAATTACC AGCTCTGGCT GGGTCGCCAC AACTTGTTTG AC GACGAAAA CACAGCCCAG 3 00 
69DNYQ LWL GRH N L F D DEN TAQ 88 

301 TTTGTTCATG TCAGTGAGAG CTTCCCACAC CCTGGCTTCA ACATGAGCCT CCTGGAGAAC 3 60 
89FVHV SES FPH PGFN M S L L E N 108 

3 61 CACACCCGCC AAGCAGACGA GGACTACAGC CACGACCTCA TGCTGCTCCG CCTGACAGAG 420 
109HTRQ ADE DYS HDLM LLR LTE 128 

421 CCTGCTGATA CCATCACAGA TGCTGTGAAG GTCGTGGAGT TGCCCACCNA GGAACCCGAA 480 
129PADT ITD AVK VVEL PTX EPE 148 

481 GTGGGGAGCA CCTGTTTGGC TTCCGGCTGG GGCAGCATCG AACCAGAGAA TTTCTCATTT 540 
149 VGST CLA SGW GSIE PEN FSF 168 

541 CCAGATGATC TCCAGTGTGT GGAC C T C AAA ATCCTGCCTA ATGATGAGTG CNAAAAAGCC 600 
169PDDL QCV DLK ILPN DEC XKA 188 

601 CACGTCCAGA AGGTGACAGA CTTCATGCTG TGTGTCGGAC ACCTGGAAGG TGGCAAAGAC 660 
189HVQK VTD F M L CVGH LEG GKD 208 

661 ACCTGTGTGG GTGATTCAGG GGGCCCGCTG ATGTGTGATG GTGTGCTCCA AGGTGTCACA 72 0 
209 TCVG DSG GPL MCDG VLQ GVT 228 

721 TC ATGGGGC T ACGTCCCTTG TGGCACCCCC AATAAGCCTT CTGTCGCCGT CAGAGTGCTG 7 80 
229 SWGY VPC GTP NKPS VAV RVL 248 

7 81 TCTTATGTGA AGTGGATCGA GGACACCATA GCGGAGAACT CCTGAACGCC CAGCCCTGTC 840 
249 SYVK WIE DTI AENS * 262 



841 CCCTACCCCC AGTAAAATCA AATGTGCATC C 871 
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1 CTGTGCATGG CATCATCCTG 
1 MA S S W 



61 TTCCCTCAAA ATGCTACGGC 
19FPQN ATA 



121 GTGCTGCCGA CATTTATCAT 
39VLPT FII 



181 CTGTTGGTCT TCCTCCTGCC 
59LLVF LLP 



241 CTGGCAGCCT CTGATCTGGT 
79LAAS DLV 



3 01 AACCAGTTTA ACTGGCCTTT 
99NQFN WPF 



361 GCCAATTTGT TCATCAGCAT 
119 A N L F I S I 



421 CTGGTGCACC CTATGGCCAG 
139 L V H P MAS 



481 GTGCTCATCT GGGTTGTGGG 
159 V L I W V V G 

541 CAAGCCGTCC CAGATCTGAA 
179 Q A V P D L N 



6 01 CACTTTGCAA GGATTGTGGA 
199 H F A R I V E 



661 GTCTTCTTCA ACTACCACAT 
219 V F F N Y H I 

721 AGAGTGC GGG GGCCGAAGGA 
239 R V R G P K D 



781 TTCCTGGTCT GCTGGGCCCC 

259 F L V C WAP 

841 CAAGCAGTCC GAGGCTGCTT 

279 Q A V R G C F 



Figure 25A 

GCCCCCTCTA GAGCTCCAAT 
PPL E L Q S 



CTGTGACAAT GCTCCAGAAG 
C D N A P E A 



CTCCATCTGT TTCTTCGGCC 
SIC F F G L 



CCGGCGGCAA CTGAACGTGG 
R R Q L N V A 



GTTTGTCTTG GGCTTGCCCT 
F V L G L P F 



CGGAGCCCTC CTCTGCCGTG 
GAL L C R V 



CTTCCTGGTG GTGGCCATCA 
F L V V A I S 



C GGAAGGC AG CAGCGGCGGA 
G R Q Q R R R 



GGGCCTCTTG AGCATCCCCA 
G L L S I P T 



CATCACCGCC TGCATCCTGC 
I T A C I L L 



GTTAAATATT CTGGGTTTCC 
L N I L G F L 



CCTGGCCTCC CTGCGAACGC 
LAS L R T R 



TAGCAAGACC ACAGCGCTGA 
S K T TALI 



TTACCACTTC TTTGCCTTCC 
Y H F F A F L 



TTGGGAGGAC TTCATTGACC 
WED F I D L 



CCTCCAACCA GAGCCAGCTC 60 
S N Q SQL 18 

CCTGGGACCT GCTGCACAGA 12 0 
W D L L H R 38 



TCCTAGGGAA CCTTTTTGTC 180 
L G N L F V 58 



CAGAAATCTA CCTGGCCAAC 2 40 
E I Y LAN 78 



TCTGGGCAGA GAATATCTGG 300 
W A E N I W 98 

TCATCAATGG GGTCATCAAG 3 60 
I N G V I K 118 



GCCAGGACCG CTACCGCGTG 42 0 

Q D R Y R V 138 

GGCAGGCCCG GGTCACCTGC 480 

Q A R V T C 158 



CATTCCTGCT GCGATCCATC 540 
F L L RSI 178 

TCCTCCCCCA TGAGGCCTGG 600 
L P H E A W 198 



TCCTACCACT GGCTGCGATC 6 60 
L P L A A I 218 



GGGAGGAGGT CAGCAGGACA 72 0 
E E V S R T 238 



TCCTCACGCT CGTGGTTGCC 7 80 
L T L V V A 258 



TGGAATTCTT ATTCCAGGTG 840 
E F L F Q V 278 



TGGGCCTGCA ATTGGCCAAC 900 
G L Q LAN 298 
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Figure 25B 

901 TTCTTTGCCT TCACTAACAG CTCCCTGAAT CCAGTAATTT ATGTCTTTGT GGGCCGGCTC 960 
299 FFAF TNS SLN PVIY VFV GRL 318 

961 TTCAGGACCA AGGTCTGGGA AC TT TAT AAA CAATGCACCC CTAAAAGTCT TGCTCCAATA 102 0 
3 19 F R T K VWE LYK QCTP KSL API 338 

1021 TCTTCATCCC ATAGGAAAGA AATCTTCCAA CTTTTCTGGC GGAATTAAAA CAGCATTGAA 1080 
339 SSSH RKE IFQ LFWR N * 353 



1081 CC 1082 
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m 
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1 CTGTGCATGG CATCATCCTG 
1 M A S S W 



61 TTCCCTCAAA ATGCTACGGC 
19FPQN ATA 



121 GTGCTGCCGA CATTTATCAT 
39 V L P T FII 



181 CTGTTGGTCT TCCTCCTGCC 
59LLVF LLP 



241 CTGGCAGCCT CTGATCTGGT 
79LAAS DLV 



3 01 AACCAGTTTA ACTGGCCTTT 
99NQFN WPF 



3 61 GCCAATTTGT TCATCAGCAT 
119 A N L F I S I 

421 C TGGTGC AC C CTATGGCCAG 
139 L V H P MAS 



481 GTGCTCATCT GGGTTGTGGG 
159 V L I W V V G 



541 CAAGCCGTCC CAGATCTGAA 
179 Q A V P D L N 



601 CACTTTGCAA GGAT TGTGGA 
199 H F A R I V E 



661 GTCTTCTTCA ACTACCACAT 
219 V F F N Y H I 



721 AGAGTGCGGG GGCCGAAGGA 
239 R V R G P K D 



7 81 TTCCTGGTCT GCTGGGCCCC 
259 F L V C WAP 



841 CAAGCAGTCC GAGGCTGCTT 
279 Q A V R G C F 



Figure 26A 

GCCCCCTCTA GAGCTCCAAT 
PPL E L Q S 



CTGTGACAAT GCTCCAGAAG 
C D N A P E A 



CTCCATCTGT TTCTTCGGCC 
SIC F F G L 



CCGGCGGCAA CTGAACGTGG 
R R Q L N V A 



GTTTGTCTTG GGCTTGCCCT 
F V L G L P F 



CGGAGCCCTC CTCTGCCGTG 
GAL L C R V 



CTTCCTGGTG GTGGCCATCA 

F L V V A I S 

C GGAAGGC AG CAGCGGCGGA 

G R Q Q R R R 



GGGCCTCTTG AGCATCCCCA 
G L L S I P T 



CATCACCGCC TGCATCCTGC 
I T A C I L L 



GTTAAATATT CTGGGTTTCC 
L N I L G F L 



CCTGGCCTCC CTGCGAACGC 
LAS L R T R 



TAGCAAGACC AC AGCGC TGA 
S K T TALI 



TTACCACTTC TTTGCCTTCC 
Y H F F A F L 



TTGGGAGGAC TTCATTGACC 
WED F I D L 



CCTCCAACCA GAGCCAGCTC 60 
S N Q SQL 18 



CCTGGGACCT GCTGCACAGA 12 0 
W D L L H R 38 



TCCTAGGGAA CCTTTTTGTC 180 
L G N L F V 58 



CAGAAATCTA CCTGGCCAAC 2 40 
E I Y LAN 78 



TCTGGGCAGA GAATATCTGG 300 
W A E N I W 98 



TCATCAACGG GGTCATCAAG 3 60 
I N G V I K 118 



GCCAGGACCG CTACCGCGTG 42 0 
Q D R Y R V 138 

GACAGGCCCG GGTCACCTGC 480 
Q A R V T C 158 



CATTCCTGCT GCGATCCATC 540 
F L L RSI 178 



TCCTCCCCCA TGAGGCCTGG 600 
L P H E A W 198 



TCCTACCACT GGCTGCGATC 660 
L P L A A I 218 



GGGAGGAGGT CAGCAGGACA 72 0 
E E V S R T 238 



TCCTCACGCT CGTGGTTGCC 780 
L T L V V A 258 



TGGAATTCTT ATTCCAGGTG 840 
E F L F Q V 278 



TGGGCCTGCA ATTGGCCAAC 900 
G L Q LAN 298 
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Figure 26B 



901 TTCTTTGCCT TCACTAACAG CTCCCTGAAT CCAGTAATTT ATGTCTTTGT GGGCCGGCTC 960 
299 FFAF TNS SLN PVIY VFV GRL 318 



961 TTCAGGACCA AGGTCTGGGA AC TT TAT AAA CAATGCACCC CTAAAAGTCT TGCTCCAATA 102 0 
319 FRTK VWE LYK QCTP KSL API 338 



1021 TCTTCATCCC ATAGGAAAGA AATCTTCCAA CTTTTCTGGC GGAATTAAAA CAGCATTGAA 1080 
339 SSSH RKE IFQ LFWR N * 353 



1081 CC 1082 
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Figure 27A 

1 CTGTGCATGG CATCATCCTG GCCCCCTCTA GAGCTCCAAT CCTCCAACCA GAGCCAGCTC 60 
1 MA SSW PPL ELQS S N Q SQL 18 

61 TTCCCTCAAA ATGCTACGGC CTGTGACAAT GCTCCAGAAG CCTGGGACCT GCTGCACAGA 120 
19FPQN ATA CDN APEA WDL LHR 38 

121 GTGCTGCCGA CATTTATCAT CTCCATCTGT TTCTTCGGCC TCCTAGGGAA CCTTTTTGTC 180 
39VLPT FII SIC FFGL LGN L F V 58 

181 CTGTTGGTCT TCCTCCTGCC CCGGCGGCAA CTGAACGTGG CAGAAATCTA CCTGGCCAAC 240 
59LLVF LLP RRQ LNVA E I Y LAN 78 

241 CTGGCAGCCT CTGATCTGGT GTTTGTCTTG GGCTTGCCCT TCTGGGCAGA GAATATCTGG 3 00 
79LAAS DLV FVL GLPF W A E NIW 98 

301 AACCAGTTTA ACTGGCCTTT CGGAGCCCTC CTCTGCCGTG TCATCAACGG GGTCATCAAG 360 
99NQFN WPF GAL LCRV I N G VIK 118 

3 61 GCCAATTTGT TCATCAGCAT CTTCCTGGTG GTGGCCATCA GCCAGGACCG CTACCGCGTG 42 0 
119ANLF ISI FLV VAIS QDR YRV 138 



421 CTGGTGCACC CTATGGCCAG 
139 L V H P MAS 



481 GTGCTCATCT GGGTTGTGGG 
159 V L I w V V G 



541 CAAGCCGTCC CAGATCTGAA 
179 Q A V P D L N 



C GGAAGGC AG CAGCGGCGGA 

G R Q Q R R R 

GGGCCTCTTG AGCATCCCCA 

G L L S I P T 

CATCACCGCC TGCATCGTGC 

I T A C I V L 



GGCAGGCCCG GGTCACCTGC 480 
Q A R V T C 158 

CATTCCTGCT GCGATCCATC 540 
F L L RSI 178 

TCCTCCCCCA TGAGGCCTGG 600 
L P H E A W 198 



601 CACTTTGCAA GGATTGTGGA 
199 H F A R I V E 



661 GTCTTCTTCA ACTACCACAT 
219 V F F N Y H I 



721 AGAGTGC GGG GGCCGAAGGA 
239 R V R G P K D 



GTTAAATATT CTGGGTTTCC 
L N I L G F L 



CCTGGCCTCC CTGCGAACGC 
LAS L R T R 

TAGCAAGACC ACAGCGCTGA 
S K T TALI 



TCCTACCACT GGCTGCGATC 660 
L P L A A I 218 

GGGAGGAGGT CAGCAGGACA 72 0 
E ' E V S R T 238 

TCCTCACGCT CGTGGTTGCC 7 80 
L T L V V A 258 



781 TTCCTGGTCT GCTGGGCCCC TTACCACTTC 
259 FLVC WAP YHF 



841 CAAGCAGTCC GAGGCTGCTT TTGGGAGGAC 
279 Q A V R GCF WED 



TTTGCCTTCC TGGAATTCTT ATTCCAGGTG 840 

FAFL EFL F Q V 278 

TTCATTGACC TGGGCCTGCA ATTGGCCAAC 9 00 

F I D L GLQ LAN 298 
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901 TTCTTTGCCT TCACTAACAG 
299 F F A F T N S 



961 TTCAGGACCA AGGTC TGGG A 
319 F R T K V W E 



1021 TCTTCATCCC ATAGGAAAGA 
339 S S S H R K E 



Figure 27B 



CTCCCTGAAT CCAGTAATTT 
S L N P V I Y 



AC T TT AT AAA CAATGCACCC 
L Y K Q C T P 

AATCTTCCAA CTTTTCTGGC 
I F Q L F W R 



ATGTCTTTGT GGGCCGGCTC 960 
V F V G R L 318 

CTAAAAGTCT TGCTCCAATA 102 0 
K S L API 338 



GGAATTAAAA CAGCATTGAA 1080 
N * 353 



1081 CC 1082 
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1 CTGTGCATGG 
1 MA 



61 TTCCCTCAAA 
19 F P Q N 



121 GTGCTGC CGA 
39 V L P T 



181 CTGTTGGTCT 

59 L L V F 

241 CTGGCAGCCT 

79 L A A S 



3 01 AACCAGTTTA 

IK 99 N Q F N 

g 

3 61 GCCAATTTGT 

!!"! 119 A N L F 

|j] 421 CTGGTGCACC 

in 13 9 L V H P 

H 481 GTGCTCATCT 

jf{! 159 V L I W 

3 

Ul 541 CAAGCCGTCC 

CI 179 Q A V P 

H 

601 CACTTTGCAA 

199 H F A R 



661 GTCTTCTTCA 
219 V F F N 

721 AGAGTGCGGG 
239 R V R G 



7 81 TTCCTGGTCT 
259 F L V C 




Figure 2 8A 

CATCATCCTG GCCCCCTCTA GAGCTCCAAT 
SSW PPL ELQS 



ATGCTACGGC CTGTGACAAT GCTCCAGAAG 
ATA CDN APEA 



CATTTATCAT CTCCATCTGT TTCTTCGGCC 
FII SIC FFGL 



TCCTCCTGCC CCGGCGGCAA CTGAACGTGG 
LLP RRQ LNVA 



CTGATCTGGT GTTTGTCTTG GGCTTGCCCT 
DLV FVL GLPF 



ACTGGCCTTT CGGAGCCCTC CTCTGCCGTG 
WPF GAL LCRV 



TCATCAGCAT CTTCCTGGTG GTGGC C AT C A 
ISI FLV VAIS 



CTATGGCCAG CGGAAGGCAG CAGCGGCGGA 
MAS GRQ QRRR 



GGGTTGTGGG GGGCCTCTTG AGCATCCCCA 
VVG GLL SIPT 



CAGATCTGAA CATCACCGCC TGCATCCTGC 
DLN ITA CILL 



GGATTGTGGA GTTAAATATT CTGGGTTTCC 
IVE LNI LGFL 



ACTACCACAT CCTGGCCTCC CTGCGAACGC 

YHI LAS LRTR 

GGCCGAAGGA TAGCAAGACC AC AGCGC TG A 

PKD SKT TALI 



GCTGGGCCCC TTACCACTTC TTTGCCTTCC 
WAP YHF FAFL 




CCTCCAACCA GAGCCAGCTC 60 
S N Q SQL 18 



CCTGGGACCT GCTGCACAGA 120 
W D L L H R 38 



TCCTAGGGAA CCTTTTTGTC 180 
L G N L F V 58 



CAGAAATCTA CCTGGCCAAC 240 
E I Y LAN 78 



TCTGGGCAGA GAATATCTGG 300 
W A E N I W 98 



TCATCAACGG GGTCATCAAG 3 60 
I N G V I K 118 



GCCAGGACCG CTACCGCGTG 42 0 
Q D R Y R V 138 



GGCAGGCCCG GGTCACCTGC 480 
Q A R V T C 158 



CATTCCTGCT GC GAT C CATC 540 
F L L RSI 178 



TCCTCCCCCA TGAGGCCTGG 600 
L P H E A W 198 



TCCTACCACT GGCTGCGATC 660 

L P L A A I 218 

GGGAGGAGGT C AGC AGGAC A '720 

K E V S R T 238 



TCCTCACGCT CGTGGTTGCC 7 80 
L T L V V A 258 



TGGAATTCTT ATTCCAGGTG 840 
E F L F Q V 278 



841 CAAGCAGTCC GAGGCTGCTT T TGGG AGGAC TTCATTGACC TGGGCCTGCA ATTGGCCAAC 900 
279 QAVR GCF WED FIDL GLQ LAN 298 
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901 TTCTTTGCCT TCACTAACAG 
299 F F A F T N S 



961 TTCAGGACCA AGGTCTGGGA 
319 F R T K V W E 



1021 TCTTCATCCC ATAGGAAAGA 
339 S S S H R K E 



1081 CC 1082 



Figure 28B 



CTCCCTGAAT CCAGTAATTT 

S L N P V I Y 

AC T TT AT AAA CAATGCACCC 

L Y K Q C T P 



AATCTTCCAA CTTTTCTGGC 
I F Q L F W R 



ATGTCTTTGT GGGCCGGCTC 960 
V F V G R L 318 



CTAAAAGTCT TGCTCCAATA 102 0 
K S L API 338 



GGAATTAAAA CAGCATTGAA 1080 
N * 353 
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Figure 29A 

1 ATGTTCTCTC CCTGGAAGAT ATCAATGTTT CTGTCTGTTT GTGAGG AC T C CGTGCCCACC 60 

1MFSP WKI SMF LSVC EDS VPT 20 

61 ACGGCCTCTT TCAGCGCCGA CATGCTCAAT GTCACCTTGC AAGGGCCCAC TCTTAACGGG 12 0 

21TASF SAD MLN VTLQ GPT LNG 40 

121 ACCTTTGCCC AGAGCAAATG CCCCCAAGTG G AGTGGC TGG GCTGGCTCAA CACCATCCAG 180 

41TFAQ SKC PQV EWLG WLN T I Q 60 

181 CCCCCCTTCC TCTGGGTGCT GTTCGTGCTG GCCACCCTAG AG AAC AT C T T TGTCCTCAGC 240 

61PPFL WVL FVL ATLE NIF VLS 80 



241 GTCTTCTGCC TGCACAAGAG CAGCTGCACG GTGGCAGAGA TCTACCTGGG GAACCTGGCC 3 00 

81VFCL HKS SCT VAEI YLG NLA 100 

301 GCAGCAGACC TGATCCTGGC CTGCGGGCTG CCCTTCTGGG CCATCACCAT CTCCAACAAC 3 60 

101 AADL I L A CGL PFWA ITI SNN 120 

3 61 TTCGACTGGC TCTTTGGGGA GACGCTCTGC CGCGTGGTGA ATGCCATTAT CTCCATGAAC 420 

121 FDWL FGE TLC RVVN All SMN 140 



421 CTGTACAGCA GCATCTGTTT CCTGATGCTG GTGAGCATCG ACCGCTACCT GGCCCTGGTG 480 
141 LYSS ICF LML VSID RYL A L V 160 



481 AAAACCATGT CCATGGGCCG GATGC GCGGC GTGCGC T GGG CCAAGCTCTA CAGCTTGGTG 540 
161 KTMS MGR MRG VRWA KLY SLV 180 

541 ATC TGGGGGT GTACGCTGCT CCTGAGCTCA CCCATGCTGG TGTTCCGGAC CATGAAGGAG 600 
181 IWGC TLL LSS PMLV FRT MKE 200 

601 TACAGCGATG AGGGCCACAA CGTCACCGCT TGTGTCATCA GCTACCCATC CCTCATCTGG 660 
201 YSDE GHN VTA CVIS YPS LIW 220 



661 GAAGTGT T C A CCAACATGCT CCTGAATGTC GTGGGCTTCC TGCTGCCCCT GAGTGTCATC 72 0 

221 EVFT NML LNV VGFL LPL SVI 240 

721 ACCTTCTGCA C GATGC AGAT CATGCAGGTG CTGCGGAACA AC GAGATGC A GAAGTTCAAG 7 80 

241 TFCT MQI MQV LRNN EMQ KFK 260 

781 GAGATCCAGA CGGAGAGGAG GGC C ACGGTG CTAGTCCTGG TTGTGCTGCT GCTATTCATC 840 

261 EIQT ERR A T V LVLV VLL LFI 280 



841 ATCTGCTGGC TGCCCTTCCA GATCAGCACC TTCCTGGATA CGCTGCATCG CCTCGGCATC 900 
281 ICWL PFQ 1ST FLDT LHR LGI 300 
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Figure 29B 

901 CTCTCCAGCT GCCAGGACGA GC GC AT CATC GATGTAATCA CACAGATCGC CTCCTTCATG 960 

301 LSSC QDE RII DVIT QIA SFM 320 

961 GCCTACAGCA ACAGCTGCCT CAACCCACTG GTGTACGTGA TCGTGGGCAA GCGCTTCCGA 102 0 

321 AYSN SCL NPL VYVI VGK RFR 340 



1021 AAGAAGTCTT GGGAGGTGTA CCAGGGAGTG TGCCAGAAAG GGGGC TGC AG GTCAGAACCC 1080 

341 KKSW EVY QGV CQKG GCR SEP 360 

1081 AT T C AGATGG AGAACTCCAT GGGCACACTG CGGACCTCCA TCTCCGTGGA ACGCCAGATT 1140 

361 IQME NSM GTL RTSI SVE RQI 380 

1141 CACAAACTGC AGGACTGGGC AGGGAGCAGA CAGTGAGCAA AC GC C AGC AG GGCTGCTGTG 1200 

381 HKLQ DWA GSR Q* 391 

12 01 AATTTGTGTA AGGATTGAGG GACAGTTGCT TTTCAGCATG GGCCCAGGAA TGC C AAGGAG 12 60 



12 61 ACATCTATGC ACGACCTTGG GAAATGAGTT GATGTCTCCG GTAAAACACC GGAGACTAAT 13 2 0 



Q 

a 

>;is* ...... 

yf 1321 TCCTGCCCTG CCCAATTTTG CAGGGAGCAT GGCTGTGAGG ATGGGGTGAA CTCACGCACA 1380 

ii 

1381 GCCAAGGACT CCAAAATCAC AACAGCATTA CTGTTCTTAT TTGCTGCCAC ACCTGAGCCA 1440 

m 

il! ...... 

1441 GCCTGCTCCT TCCCAGGAGT GGAGGAGGCC TGGGGGGAGG GAGAGGAGTG ACTGAGCTTC 1500 

...... 

t) 1501 CCTCCCGTGT GTTCTCCGTC CCTGCCCCAG CAAGACAACT TAGATCTCCA GGAGAAC TGC 1560 

y 

tF : i ...... 



1561 CATC C AGC TT TGGTGCAATG GCTGAGTGCA CAAGTGAGTT GTTGCCCTGG GTTTCTTTAA 162 0 
1621 TCTATTCAGC TAGAACTTTG AAGGACAATT TCTTGCATTA ATAAAGGTTA AGCCCTGAGG 1680 
1681 GGTCCCTGAT AACAACCTGG AGACCAGGAT TTTATGGCTC CCCTCACTGA TGGACAAGGA 1740 
1741 GGTCTGTGCC AAAGAAGAAT CCAATAAGCA CATATTGAGC ACTTGCTGTA TATGCAGTAT 1800 
1801 TGAGCACTGT AGGCAAGACC CAAGAAAGAG AAGGAGC CAT CTCCATCTTG AAGGAACTCA 186 0 
1861 AAGACTCAAG TGGGAACGAC TGGGCACTGC CACCACCAGA AAGCTGTTCG ACGAGACGGT 192 0 
1921 CGAGCAGGGT GCTGTGGGTG ATATGGACAG CAGAAGGGGG AG AC C AAGGT TCCAGCTCAA 1980 
19 81 CCAATAACTA TTGCACAACC ACCTGTCCCT GCCTCAGTTC CCTTTTATGT AACATGAAGT 2 040 
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Figure 29C 

2 041 CGTTGTGAGG GTTAAAGGCA GTAACAGGTA TAAAGTACTT AGAAAAGCAA AGGGTGCTAC 2100 
2101 GTACATGTGA GGCATCATTA CGCAGACGTA ACTGGGATAT GT T T AC TATA AGGAAAAGAC 2160 
2161 ACTGAGGTCT AGAAATAGCT CCGTGGAGCA GAATCAGTAT TGGGAGCCGG TGGCGGTGTG 222 0 
22 21 AAGCACCAGT GTCTGGCACA CAGTAGGTGC TCATTGGCTC CCTTCCACCT GTCATTCCCA 22 80 

22 81 CCACCCTGAG GCCCCAACCG CCACACACAC AGGAGCATTT GGAGAGAAGG CCATGTCTTC 2340 

23 41 AAAGTCTGAT TTGTGATGAG GCAGAGGAAG AT ATT TC T AA TCGGTCTTGC CCAGAGGATC 2400 
2401 AC AGTGC TGA GACCCCCCAC CACCAGCCGG TACCTGGGAA GGGGGAGAGT GCAGGCCTGC 2460 
2461 TCAGGGACTG TTCCTGTCTC AGCAACCAAG GGATTGTTCC TGTCAATCAA TGGTTTATTG 252 0 
2 521 GAAGGTGGC C CAGTATGAGC C C T AG AAGAG TGTGAAAAGG AATGGCAATG GTGTTCACCA 2580 
2 5 81 TCGGCAGTGC CAGGGCAGCA CTCATTCACT TGATAAATGA ATATTTATTA GCTGGTTGGA 2 640 
2 641 GAGCTAGAAC CTGGAGAGCT AGAACCTGGA GAACTAGAAC C TGGAGGGC T AGAACCTGGA 270 0 
2701 GAGGCTAGAA CCAAGAAGGG CTAGAACCTG GAGGGGCTAG AACCTAGAGA AGCTAAAACC 2760 
2761 TGAGCTAGAA GCTGGAGGAC TAGAACCTGG AGGGCTGGAA TCTGAAGGGC TAGAACCTGG 282 0 
2 821 AGGGCTGGAA TCTGGAGAGC TAGAACCTGG AGGGC T AG AA C C TGGAGGGC TAGAACCTAG 2880 
2 8 81 AAGGGCTAGA AC C TGGAGGG CTGGAATCTG GAGAGCTAGA AC C TGGAGGG CTAGAACCTG 2 940 

2 941 GAGGGCTAGA ACCTAGAAGG GCTAGAACCT GGAGGGCTAG AACCTGGCAG GTTAGAACCT 3 000 

3 001 AGAAGGGCTA GAACCTGGAG AGCCAGAACC TGGAGGGC T A GAACCTGGAA GGGCTAGAAC 3 060 
3 061 CTGTAGAGCT AGAACATGGA GAGCTAGAAC CCGGCAGGCT AGAACCTGGC AAGCTAGAAC 312 0 
3121 CTGGAGGGAA TGAACCTGGA GGGCTAGAAC CTGGAGAATG AGAAAAATTT ACATGGCAAA 318 0 
3181 GAGCCCATAA ATCCTGACCA ATCCAACTCT GAATTTTAAA GCAAAAGCGT GAAAAAAAAG 32 40 
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Figure 29D 

3241 ATTCCCTCCT TACCCCCAAC CCACTCTTTT TTCCCACCAC CCACTCTCCT CTGCCTCAGT 33 00 
33 01 AAGTATCTGG AGGAAGAAAA CAGGTGAAAG AAGAAGTAAA AACCATTTAG TATTAGTATT 33 60 
3361 AGAATGAAGT C AAAC TGTGC CACACATGGT GAATGAAAAA AAAAAAAAAG AGGC TGTGTT 342 0 
3421 TTGTCACACA GGGCAGTCAT TCAGCACCAG AGC AC GTGAT GGTC TGAGAC TCTCTTAGGA 3480 
3481 GCAGAGCTCT GCCGCAATGG CCATGTGGGG ATCCACACCT GGTC TGAGGG GCAACTGAGT 3 540 
3 541 CTGCGGGAGA AGAGCGGCCC TATGCATGGT GTAGATGCCC TGATAAAGAA CATCTGTCCT 3600 
3 601 GTGAAAGACT CAATGAGCTG TTATGTTGTA AACAGGAAGC ATTTCACATC CAAACGAGAA 3660 
3 661 AATCATGTAA AC ATGTGTC T TTTCTGTAGA GCATAATAAA TGGATGAGGT TTTTGCAAAA 372 0 
3721 AAAAAAAAAA AAA 3733 
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1 ATGTTCTCTC CCTGGAAGAT 
1MFSP WKI 



61 ACGGCCTCTT TCAGCGCCGA 
21TASF SAD 

121 ACCTTTGCCC AGAGCAAATG 
41TFAQ SKC 



181 CCCCCCTTCC TCTGGGTGCT 
61PPFL WVL 



241 GTCTTCTGCC TGCACAAGAG 
81VFCL HKS 

301 GCAGCAGACC TGATCCTGGC 
101 A A D L I L A 



361 TTCGACTGGC TCTTTGGGGA 
121 F D W L F G E 



421 CTGTACAGCA GCATCTGTTT 
141 L Y S S I C F 



481 AAAACCATGT CCATGGGCCG 
161 K T M S M G R 



541 ATCTGGGGGT GTACGCTGCT 
181 I W G C T L L 



6 01 TACAGCGATG AGGGCCACAA 
201 Y S D E G H N 



661 GAAGTGTTCA CCAACATGCT 
221 E V F T N M L 



721 ACCTTCTGCA CGATGCAGAT 

241 T F C T M Q I 

781 GAGATCCAGA CGGAGAGGAG 

261 E I Q T ERR 



841 ATCTGCTGGC TGCCCTTCCA 
281 I C W L P F Q 



Figure 3 OA 

ATCAATGTTT CTGTCTGTTC 
S M F L S V R 



CATGCTCAAT GTCACCTTGC 
M L N V T L Q 



CCCCCAAGTG GAGTGGC TGG 
P Q V E W L G 



GTTCGTGCTG GCCACCCTAG 
F V L A T L E 



CAGCTGCACG GTGGCAGAGA 
S C T V A E I 



CTGCGGGCTG CCCTTCTGGG 
C G L P F W A 



GACGCTCTGC CGCGTGGTGA 
T L C R V V N 



CCTGATGCTG GTGAGCATCG 
L M L V S I D 

GATGCGCGGC GTGCGCTGGG 
M R G V R W A 



CCTGAGCTCA CCCATGCTGG 
L S S P M L V 



CGTCACCGCT TGTGTCATCA 
VTA C V I S 



CCTGAATGTC GTGGGCTTCC 
L N V V G F L 



CATGCAGGTG CTGCGGAACA 
M Q V L R N N 



GGCCACGGTG CTAGTCCTGG 
A T V L V L V 



GATCAGCACC T T C C TGG AT A 
1ST F L D T 



GTGAGGACTC CGTGCCCACC 60 
EDS V P T 20 



AAGGGCCCAC TCTTAACGGG 12 0 
G P T L N G 40 



GCTGGCTCAA CACCATCCAG 180 
W L N T I Q 60 



AG AAC AT C T T TGTCCTCAGC 240 
N I F V L S 80 



TCTACCTGGG GAACCTGGCC 300 
Y L G NLA 100 



CCATCACCAT CTCCAACAAC 3 60 
I T I S N N 120 



ATGC C ATT AT CTCCATGAAC 42 0 
All S M N 140 

ACCGCTACCT GGCCCTGGTG 480 
R Y L A L V 160 



CCAAGCTCTA CAGCTTGGTG 540 
K L Y S L V 180 



TGTTCCGGAC CATGAAGGAG 600 
F R T M K E 200 



GCTACCCATC CCTCATCTGG 660 
Y P S L I W 220 



TGCTGCCCCT GAGTGTCATC 72 0 
L P L S V I 240 



AC GAG ATGC A GAAGTTCAAG 780 
E M Q K F K 260 



TTGTGCTGCT GCTATTCATC 840 
V L L L F I 280 



CGCTGCATCG CCTCGGCATC 900 
L H R L G I 300 
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Figure 3 OB 

901 CTCTCCAGCT GCCAGGACGA GCGCATCATC GACGTAATCA CACAGATCGC CTCCTTCATG 960 
3 01 L S S C QDE R I I DVIT Q I A SFM 320 

961 GCCTACAGCA ACAGCTGCCT CAACCCACTG GTGTACGTGA TCGTGGGCAA GCGCTTCCGA 1020 
321AYSN SCL NPL VYVI VGK RFR 340 

1021 AAGAAGTCTT GGGAGGTGTA CCAGGGAGTG TGCCAGAAAG GGGGC TGC AG GTCAGAACCC 1080 
341KKSW EVY QGV CQKG GCR SEP 360 

1081 ATTCAGATGG AGAACTCCAT GGGCACACTG CGGACCTCCA TCTCCGTGGA ACGCCAGATT 1140 
361IQME NSM GTL RTSI SVE R Q I 380 

1141 CACAAACTGC AGGAC TGGGC AGGGAGCAGA CAGTGAGCAA ACGCCAGCAG GGCTGCTGTG 12 00 
381 HKLQ DWA GSR Q* 391 

12 01 AATTTGTGTA AGGATTGAGG GACAGTTGCT TTTCAGCATG GGCCCAGGAA TGC C AAGGAG 12 60 

12 61 ACATCTATGC ACGACCTTGG GAAATGAGTT GATGTCTCCG GTAAAACACC GGAGACTAAT 132 0 

13 21 TCCTGCCCTG CCCAATTTTG CAGGGAGCAT GGCTGTGAGG ATGGGGTGAA CTCACGCACA 13 80 
13 81 GCCAAGGACT CCAAAATCAC AACAGCATTA CTGTTCTTAT TTGCTGCCAC ACCTGAGCCA 1440 
1441 GCCTGCTCCT TCCCAGGAGT GGAGG AGGC C TGGGGGGAGG G AG AGGAGT G ACTGAGCTTC 1500 
1501 CCTCCCGTGT GTTCTCCGTC CCTGCCCCAG CAAGACAACT TAGATCTCCA GGAGAACTGC 1560 
15 61 CATCCAGCTT TGGTGCAATG GCTGAGTGCA CAAGTGAGTT GTTGCCCTGG GTTTCTTTAA 162 0 
1621 TCTATTCAGC TAGAACTTTG AAGGACAATT TCTTGCATTA ATAAAGGTTA AGCCCTGAGG i6 80 
1681 GGTCCCTGAT AACAACCTGG AGACCAGGAT TTTATGGCTC CCCTCACTGA TGGACAAGGA 1740 
1741 GGTCTGTGCC AAAGAAGAAT CCAATAAGCA CATATTGAGC ACTTGCTGTA TATGCAGTAT 1800 
1801 TGAGCACTGT AGGC AAGAC C CAAGAAAGAG AAGGAGC C AT CTCCATCTTG AAGGAACTCA 1860 
1861 AAGACTCAAG TGGGAACGAC TGGGC ACTGC CACCACCAGA AAGCTGTTCG AC GAG AC GGT 192 0 
1921 CGAGCAGGGT GCTGTGGGTG ATATGGACAG CAGAAGGGGG AG AC C AAGGT TCCAGCTCAA 1980 
1981 CCAATAACTA TTGCACAACC ACCTGTCCCT GCCTCAGTTC CCTTTTATGT AACATGAAGT 2 040 
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Figu 

2 041 CGTTGTGAGG GTTAAAGGCA GTAACAGGTA 
2101 GTACATGTGA GGCATCATTA CGCAGACGTA 
2161 ACTGAGGTCT AGAAATAGCT C C GTGGAGC A 
2221 AAGCACCAGT GTCTGGCACA CAGTAGGTGC 

22 81 CCACCCTGAG GCCCCAACCG CCACACACAC 

23 41 AAAGTC T GAT TTGTGATGAG GCAGAGGAAG 
2401 AC AGTGC TGA GACCCCCCAC CACCAGCCGG 
2 461 TCAGGGACTG TTCCTGTCTC AGCAACCAAG 
2 521 GAAGGTGGCC CAGTATGAGC CCTAGAAGAG 
2 5 81 TCGGC AGTGC CAGGGCAGCA CTCATTCACT 
2 641 GAGCTAGAAC CTGGAGAGCT AGAACCTGGA 
2701 GAGGCTAGAA CCAAGAAGGG CTAGAACCTG 
27 61 TGAGCTAGAA GCTGGAGGAC TAGAACCTGG 
2 821 AGGGC TGGAA TCTGGAGAGC TAGAACCTGG 
2 881 AAGGGCTAGA AC C TGGAGGG CTGGAATCTG 

2 941 GAGGGCTAGA ACCTAGAAGG GCTAGAACCT 
3001 AGAAGGGCTA GAACCTGGAG AGCCAGAACC 

3 061 CTGTAGAGCT AG AAC AT GGA GAGCTAGAAC 
3121 C TGGAGGG AA TGAAC CTGGA GGGCTAGAAC 
3181 GAGCCCATAA ATCCTGACCA ATCCAACTCT 



:e 30C 

TAAAGTACTT AGAAAAGCAA AGGGTGCTAC 2100 
ACTGGGATAT GTTTACTATA AGGAAAAGAC 2160 
GAATCAGTAT TGGGAGCCGG TGGC GGTGTG 222 0 
TCATTGGCTC CCTTCCACCT GTCATTCCCA 22 80 
AGGAGC ATT T GGAGAGAAGG CCATGTCTTC 2340 
AT AT TT C T AA TCGGTCTTGC CCAGAGGATC 2400 
T AC C TGGGAA GGGGGAGAGT GCAGGCCTGC 2460 
GGATTGTTCC TGTCAATCAA TGGTTTATTG 2 52 0 
TGTGAAAAGG AATGGCAATG GTGTTCACCA 2 580 
TGATAAATGA ATATTTATTA GCTGGTTGGA 2 640 
G AAC TAG AAC C T GG AGGGC T AGAACCTGGA 27 00 
GAGGGGCTAG AACCTAGAGA AGCTAAAACC 27 60 
AGGGC TGGAA TCTGAAGGGC TAGAACCTGG 2 82 0 
AGGGC TAGAA CCTGGAGGGC TAGAACCTAG 2 880 
GAGAGCTAGA AC C TGGAGGG CTAGAACCTG 2 940 
GGAGGGC T AG AAC C TGGC AG GT TAG AAC C T 3 000 
TGGAGGGC T A GAAC C TGGAA GGGCTAGAAC 3060 
CCGGCAGGCT AG AAC C TGGC AAGCTAGAAC 312 0 
CTGGAGAATG AG AAAAAT T T ACATGGCAAA 3180 
GAATTTTAAA GCAAAAGCGT GAAAAAAAAG 32 40 
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Figure 30D 

3241 ATTCCCTCCT TACCCCCAAC CCACTCTTTT TTCCCACCAC CCACTCTCCT CTGCCTCAGT 3 3 00 
33 01 AAGTATCTGG AGGAAGAAAA CAGGTGAAAG AAGAAGTAAA AACCATTTAG TATTAGTATT 3360 
3361 AGAATGAAGT C AAAC TGTGC CACACATGGT GAATGAAAAA AAAAAAAAAG AGGCTGTGTT 342 0 
3421 TTGTCACACA GGGCAGTCAT TCAGCACCAG AGCACGTGAT GGTC TGAGAC TCTCTTAGGA 3480 
3 481 GCAGAGCTCT GCCGCAATGG CCATGTGGGG ATCCACACCT GGTCTGAGGG GCAACTGAGT 3540 
3 541 CTGCGGGAGA AGAGCGGCCC TATGCATGGT GTAGATGCCC TGATAAAGAA CATCTGTCCT 3600 
3 601 GTGAAAG AC T CAATGAGCTG TTATGTTGTA AACAGGAAGC AT T TC AC AT C CAAACGAGAA 3 6 60 
3 661 AATCATGTAA AC ATGTGTC T TTTCTGTAGA GCATAATAAA TGGATGAGGT TTTTGCAAAA 372 0 
3721 AAAAAAAAAA AAA 3733 
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Figure 31A 

1 ATGTTCTCTC CCTGGAAGAT ATCAATGTTT CTGTCTGTTC GTGAGGACTC CGTGCCCACC 60 
1MFSP WKI SMF LSVR EDS VPT 20 

61 ACGGCCTCTT TCAGCGCCGA CATGCTCAAT GTCACCTTGC AAGGGC C C AC TCTTAACGGG 12 0 
21TASF SAD MLN VTLQ GPT LNG 40 

121 ACCTTTGCCC AGAGCAAATG CCCCCAAGTG GAGTGGCTGG GCTGGCTCAA CACCATCCAG 180 
41TFAQ SKC PQV EWLG WLN T I Q 60 

181 CCCCCCTTCC TCTGGGTGCT GTTCGTGCTG GCCACCCTAG AGAACATCTT TGTCCTCAGC 240 
61PPFL WVL FVL ATLE NIF VLS 80 

241 GTCTTCTGCC TGCACAAGAG CAGCTGCACG GTGGCAGAGA TCTACCTGGG GAACCTGGCC 3 00 
81 V F C L HKS SCT VAEI YLG NLA 100 

3 01 GCAGCAGACC TGATCCTGGC CTGCGGGCTG CCCTTCTGGG CCATCACCAT CTCCAACAAC 3 60 
101 AADL I L A CGL PFWA ITI SNN 120 



| ...... 

f\ 361 TTCGACTGGC TCTTTGGGGA GACGCTCTGC CGCGTGGTGA ATGCCATTAT CTCCATGAAC 42 0 

121 FDWL FGE TLC RVVN All SMN 140 



ji M 

y;i ...... 

jiff 421 CTGTACAGCA GCATCTGTTT CCTGATGCTG GTGAGC AT C G ACCGCTACCT GGCCCTGGTG 480 

fjj 141 LYSS ICF LML VSID RYL A L V 160 

S 

I-? 481 AAAACCATGT CCATGGGCCG GATGCGCGGC GTGCGCTGGG CCAAGCTCTA CAGCTTGGTG 540 

jfjj 161 KTMS MGR MRG VRWA KLY SLV 180 

¥ z \ 

Ul 541 ATCTGGGGGT GTACGCTGCT CCTGAGCTCA CCCATGCTGG TGTTCCGGAC CATGAAGGAG 600 

Q 181 IWGC TLL LSS PMLV FRT MKE 200 

601 TACAGCGATG AGGGCCACAA CGTCACCGCT TGTGTCATCA GCTACCCATC CCTCATCTGG 6 60 

201 YSD E GHN VTA CVIS YPS LIW 220 

661 GAAGTGTTCA CCAACATGCT CCTGAATGTC GTGGGCTTCC TGCTGCCCCT GAGTGTCATC 72 0 

221 EVFT NML LNV VGFL LPL SVI 240 

721 ACCTTCTGCA CGATGCAGAT CATGCAGGTG CTGCGGAACA AC GAGATGC A GAAGTTCAAG 7 80 

241 TFCT MQI MQV LRNN EMQ KFK 260 

781 GAGATCCAGA CGGAGAGGAG GGCCACGGTG CTAGTCCTGG TTGTGCTGCT GCTATTCATC 840 

261 EIQT ERR A T V LVLV VLL LFI 280 



841 ATCTGCTGGC TGCCCTTCCA GATCAGCACC TTCCTGGATA CGCTGCATCG CCTCGGCATC 900 
281 ICWL PFQ 1ST FLDT LHR LGI 300 
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Figure 3 IB 

901 CTCTCCAGCT GCCAGGACGA GCGCATCATC GATGTAATCA CACAGATCGC CTCCTTCATG 960 
301 LSSC QDE RII DVIT Q I A SFM 320 



961 GCCTACAGCA ACAGCTGCCT CAACCCACTG GTGTACGTGA TCGTGGGCAA GCGCTTCCGA 102 0 
321 AYS N SCL NPL VYVI VGK RFR 340 



1021 AAGAAGTCTT GGGAGGTGTA CCAGGGAGTG TGCCAGAAAG AGGGCTGCAG GTCAGAACCC 1080 
341 KKSW EVY QGV CQKE GCR SEP 360 



1081 ATT C AG ATGG AGAACTCCAT GGGCACACTG CGGACCTCCA TCTCCGTGGA ACGCCAGATT 1140 
361 IQME NSM GTL RTSI SVE R Q I 380 



1141 CACAAACTGC AGGAC TGGGC AGGGAGCAGA CAGTGAGCAA ACGCCAGCAG GGCTGCTGTG 12 00 
381 HKLQ DWA GSR Q* 391 



1201 AATTTGTGTA AGGATTGAGG GACAGTTGCT TTTCAGCATG GGCCCAGGAA TGCCAAGGAG 1260 



12 61 AC AT C T ATGC ACGACCTTGG GAAATGAGTT 
1321 TCCTGCCCTG CCCAATTTTG CAGGGAGCAT 

13 81 GCCAAGGACT CCAAAATCAC AACAGCATTA 
1441 GCCTGCTCCT TCCCAGGAGT GGAGGAGGCC 
1501 CCTCCCGTGT GTTCTCCGTC CCTGCCCCAG 
1561 CATCCAGCTT TGGTGCAATG GC TGAGTGC A 



GATGTCTCCG GTAAAACACC GGAGACTAAT 132 0 

GGCTGTGAGG ATGGGGTGAA CTCACGCACA 13 80 

CTGTTCTTAT TTGCTGCCAC ACCTGAGCCA 1440 

TGGGGGGAGG GAGAGGAGTG AC TGAGC TT C 1500 

CAAGACAACT TAGATCTCCA GGAGAAC TGC 1560 

CAAGTGAGTT GTTGCCCTGG GTTTCTTTAA 162 0 



1621 TCTATTCAGC TAGAACTTTG AAGGACAATT TCTTGCATTA AT AAAGGT T A AGCCCTGAGG 1680 
1681 GGTCCCTGAT AACAACCTGG AGACCAGGAT TTTATGGCTC CCCTCACTGA TGGACAAGGA 1740 



1741 GGTCTGTGCC AAAGAAGAAT CCAATAAGCA CAT AT TGAGC ACTTGCTGTA TATGCAGTAT 1800 



1801 TGAGC ACTGT AGGCAAGACC CAAGAAAGAG 
1861 AAGACTCAAG TGGGAAC GAC TGGGC ACTGC 
1921 CGAGCAGGGT GC TGTGGGTG ATATGGACAG 
1981 CCAATAACTA TT GC AC AAC C ACCTGTCCCT 



AAGGAGCCAT CTCCATCTTG AAGGAACTCA 1860 
CACCACCAGA AAGCTGTTCG ACGAGACGGT 192 0 
CAGAAGGGGG AGACCAAGGT TCCAGCTCAA 1980 
GCCTCAGTTC CCTTTTATGT AACATGAAGT 2 040 
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Figure 31C 

2041 CGTTGTGAGG GTTAAAGGCA GTAACAGGTA TAAAGTACTT AGAAAAGCAA AGGGTGCTAC 2100 
2101 GTACATGTGA GGCATCATTA C GC AG AC GT A ACTGGGATAT GTTTACTATA AGGAAAAGAC 2160 
2161 ACTGAGGTCT AGAAATAGCT CCGTGGAGCA GAATCAGTAT TGGGAGCCGG TGGCGGTGTG 222 0 
22 21 AAGCACCAGT GTCTGGCACA CAGTAGGTGC TCATTGGCTC CCTTCCACCT GTCATTCCCA 22 80 

22 81 CCACCCTGAG GCCCCAACCG CCACACACAC AGGAGC AT TT GGAGAGAAGG CCATGTCTTC 2340 

23 41 AAAGTC TGAT TTGTGATGAG GCAGAGGAAG ATATTTCTAA TCGGTCTTGC C C AG AGGATC 2400 
2401 AC AGTGC TGA GACCCCCCAC CACCAGCCGG TACCTGGGAA GGGGGAGAGT GCAGGCCTGC 2460 
2 461 TCAGGGACTG TTCCTGTCTC AGCAACCAAG GGATTGTTCC TGTCAATCAA TGGTTTATTG 252 0 

P 

f*\ 2521 GAAGGTGGCC CAGTATGAGC CCTAGAAGAG TGTGAAAAGG AATGGCAATG GTGTTCACCA 2580 

ii n 

€} 2 581 TCGGCAGTGC CAGGGCAGCA CTCATTCACT TGATAAATGA AT AT TT ATT A GCTGGTTGGA 2 640 

P ^ 

2 641 GAGCTAGAAC CTGGAGAGCT AGAACCTGGA GAACTAGAAC CTGGAGGGCT AGAACCTGGA 2700 

j*4 ...... 

jhj 27 01 GAGGCTAGAA CCAAGAAGGG CTAGAACCTG GAGGGGCTAG AACCTAGAGA AGCTAAAACC 2760 

O 

2761 TGAGCTAGAA GC TGGAGGAC TAGAACCTGG AGGGC TGGAA TCTGAAGGGC TAGAACCTGG 2 82 0 

a 

m * 

2 821 AGGGC TGGAA TCTGGAGAGC TAGAACCTGG AGGGC T AG AA CCTGGAGGGC TAGAACCTAG 2880 
2881 AAGGGCTAGA ACCTGGAGGG CTGGAATCTG GAGAGCTAGA AC CTGGAGGG CTAGAACCTG 2940 

2 941 GAGGGCTAGA ACCTAGAAGG GCTAGAACCT GG AGGGC TAG AAC CTGGC AG GTTAGAACCT 3000 

3 001 AGAAGGGCTA GAACCTGGAG AGCCAGAACC TGGAGGGC T A GAAC C TGGAA GGGCTAGAAC 3 060 
3 061 CTGTAGAGCT AGAACATGGA GAGCTAGAAC CCGGCAGGCT AGAACCTGGC AAGCTAGAAC 312 0 
3121 CTGGAGGGAA TGAACCTGGA GGGCTAGAAC CTGGAGAATG AG AAAAAT T T ACATGGCAAA 3180 
3181 GAGCCCATAA ATCCTGACCA ATCCAACTCT GAATTTTAAA GCAAAAGCGT GAAAAAAAAG 3 240 
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Figure 3 ID 

3 241 ATTCCCTCCT TACCCCCAAC CCACTCTTTT TTCCCACCAC CCACTCTCCT CTGCCTCAGT 33 00 

33 01 AAGTATCTGG AGGAAGAAAA CAGGTGAAAG AAGAAGTAAA AACCATTTAG TATTAGTATT 33 60 

3361 AGAATGAAGT CAAACTGTGC CACACATGGT GAATGAAAAA AAAAAAAAAG AGGCTGTGTT 342 0 

3421 TTGTCACACA GGGCAGTCAT TCAGCACCAG AGCACGTGAT GGTCTGAGAC TCTCTTAGGA 3480 

3481 GCAGAGCTCT GCCGCAATGG CCATGTGGGG ATCCACACCT GGTC TGAGGG GCAACTGAGT 3 540 

3 541 CTGCGGGAGA AGAGCGGCCC TATGCATGGT GTAGATGCCC TGATAAAGAA CATCTGTCCT 3 600 

3 601 GTGAAAG AC T CAATGAGCTG TTATGTTGTA AACAGGAAGC ATTTCACATC CAAACGAGAA 3 660 
3 661 AATCATGTAA AC ATGTGTC T TTTCTGTAGA GCATAATAAA TGGATGAGGT TTTTGCAAAA 372 0 
3721 AAAAAAAAAA AAA 3733 
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Figure 32A 

1 CGCCCAACCCAAGTTCAAAGGCTGATAAGAGAGAAAATCTCATGAGGAGGTTTTAGTCTA 

61 GGGAAAGTCATTCAGTGGATGTGATCTTGGCTCACAGGGGACGATGTCAAGCTCTTCCTG 
1 M S S S S W 

121 GCTCCTTCTCAGCCTTGTTGCTGTAACTGCTGCTCAGTCCACCATTGAGGAACAGGCCAA 



7 LLLSLVAVTAAQSTIE 



E Q A K 



181 GACATTTTTGGACAAGTTTAACCACGAAGCCGAAGACCTGTTCTATCAAAGTTCACTTGC 

27 TFLDKFNHEAEDLFYQSSLA 

241 TTCTTGGAATTATAACACCAATATTACTGAAGAGAATGTCCAAAACATGAATAATGCTGG 

47 SWNYNTNI TEENVQNMNNAG 

3 01 GGACAAATGGTCTGCCTTTTTAAAGGAACAGTCCACACTTGCCCAAATGTATCCACTACA 

67 DKWSAFLKEQSTLAQMYPLQ 

3 61 AGAAATTCAGAATCTCACAGTCAAGCTTCAGCTGCAGGCTCTTCAGCAAAATGGGTCTTC 

87 EIQNLTVKLQLQALQQNGSS 

421 AGTGCTCTCAGAAGACAAGAGCAAACGGTTGAACACAATTCTAAATACAATGAGCACCAT 

107 VLSEDKSKRLNTILNTMSTI 

481 CTACAGTACTGGAAAAGTTTGTAACCCAGATAATCCACAAGAATGCTTATTACTTGAACC 

127 YSTGKVCNPDNPQECLLLEP 

541 AGGTTTGAATGAAATAATGGC AAACAGTTTAGACTACAATGAGAGGCTCTGGGCTTGGGA 

147 GLNEIMANSLDYNERLWAWE 

601 AAGCTGGAGATCTGAGGTCGGCAAGCAGCTGAGGCCATTATATGAAGAGTATGTGGTCTT 

167 SWRSEVGKQLRPLYEEYVVL 

661 GAAAAATGAGATGGCAAGAGCAAATCATTATGAGGACTATGGGGATTATTGGAGAGGAGA 

187 KNEMARANHYEDYGDYWRGD 

721 CTATGAAGTAAATGGGGTAGATGGCTATGACTACAGCCGCGGCCAGTTGATTGAAGATGT 

207 YEVNGVDGYDYSRGQL IEDV 

7 81 GGAACATACCTTTGAAGAGATTAAACCATTATATGAACATCTTCATGCCTATGTGAGGGC 

227 EHTFEEIKPLYEHLHAYVRA 

841 AAAGTTGATGAATGCCTATCCTTCCTATATCAGTCCAATTGGATGCCTCCCTGCTCATTT 

247 KLMNAYPSYISPIGCLPAHL' 
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Figure 32B 



m 



9 01 GCTTGGTGATATGTGGGGTAGATTTTGGACAAATCTGTACTCTTTGACAGTTCCCTTTGG 9 60 
267 LGDMWGRFWTNLYSLTVPFG 286 



961 ACAGAAACCAAACATAGATGTTACTGATGC AATGGTGGACCAGGCCTGGGATGC ACAGAG 102 0 

287 QKPNIDVTDAMVDQAWDAQR 306 

1021 AATATTCAAGGAGGCCGAGAAGTTCTTTGTATCTGTTGGTCTTCCTAATATGACTCAAGG 108 0 

307 I FKEAEKFFVSVGLPNMTQG 326 

1081 ATTCTGGGAAAATTCCATGCTAACGGACCCAGGAAATGTTCAGAAAGCAGTCTGCCATCC 114 0 

327 FWENSMLTDPGNVQKAVCHP 346 

1141 CAC AGCTTGGGACCTGGGGAAGGGCGACTTCAGGATCCTTATGTGC AC AAAGGTGACAAT 1200 

347 TAWDLGKGDFRI LMCTKVTM 366 



ip*5 ...... 

j;j 12 01 GGACGACTTCCTGACAGCTC ATCATGAGATGGGGCATATCCAGTATGATATGGCATATGC 12 6 0 

M 367 DDFLTAHHEMGH I'QYDMAYA 386 

>b : | ... 

«J3 12 61 TGCACAACCTTTTCTGCTAAGAAATGGAGCTAATGAAGGATTCCATGAAGCTGTTGGGGA 132 0 

U] 387 AQPFLLRNGANEGFHEAVGE 406 



1321 AATCATGTCACTTTCTGCAGCCAC ACCTAAGCATTTAAAATCCATTGGTCTTCTGTCACC 13 8 0 
407 IMSLSAATPKHLKSIGLLSP 426 



|ij 13 81 CGATTTTCAAGAAGACAATGAAACAGAAATAAACTTCCTGCTCAAACAAGCACTCACGAT 144 0 

p 427 DFQEDNETEINFLLKQALTI 446 

H 

1441 TGTTGGGACTCTGCCATTTACTTACATGTTAGAGAAGTGGAGGTGGATGGTCTTTAAAGG 1500 

447 VGTL PFTYMLEKWRWMVFKG 466 

1501 GGAAATTCCCAAAGACCAGTGGATGAAAAAGTGGTGGGAGATGAAGCGAGAGATAGTTGG 1560 

467 EI PKDQWMKKWWEMKREIVG 486 

1561 GGTGGTGGAACCTGTGCCCCATGATGAAACATACTGTGACCCCGCATCTCTGTTCCATGT 162 0 

487 VVEPVPHDETYCDPASLFHV 506 

1621 TTCTAATGATTACTCATTCATTCGATATTACACAAGGACCCTTTACCAATTCCAGTTTCA 168 0 

507 SNDYSFIRYYTRTLYQFQFQ 526 

1681 AGAAGCACTTTGTCAAGCAGCTAAACATGAAGGCCCTCTGCACAAATGTGACATCTCAAA 17 4 0 

527 EALCQAAKHEGPLHKCDI SN 546 
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Figure 32C 



1741 CTCTAC AGAAGCTGGACAGAAACTGTTCAATATGCTGAGGCTTGGAAAATCAGAACCCTG 
547 STEAGQKLFNMLRLGKSEPW 

1801 GACCCTAGCATTGGAAAATGTTGTAGGAGCAAAGAACATGAATGTAAGGCCACTGCTCAA 
567 TLALENVVGAKNMNVRPLLN 



1861 CTACTTTGAGCCCTTATTTACCTGGCTGAAAGACCAGAACAAGAATTCTTTTGTGGGATG 

587 YFEPLFTWLKDQNKNSFVGW 

1921 GAGTACCGACTGGAGTCCATATGC AGACC AAAGCATCAAAGTGAGGATAAGCCTAAAATC 

607 STDWSPYADQSIKVRISLKS 

1981 AGCTCTTGGAGATAAAGCATATGAATGGAACGACAATGAAATGTACCTGTTCCGATCATC 

627 ALGDKAYEWNDNEMYLFRSS 



2 041 TGTTGC ATATGCTATGAGGCAGTACTTTTTAAAAGTAAAAAATC AGATGATTCTTTTTGG 

647 VAYAMRQYFLKVKNQMI LFG 

2101 GGAGGAGGATGTGCGAGTGGCTAATTTGAAACCAAGAATCTCCTTTAATTTCTTTGTCAC 

667 EEDVRVANLKPRI SFNFFVT 

2161 TGCACCTAAAAATGTGTCTGATATCATTCCTAGAACTGAAGTTGAAAAGGCCATCAGGAT 

687 APKNVSDI I PRTEVEKAIRM 

2221 GTCCCGGAGCCGTATCAATGATGCTTTCCGTCTGAATGACAACAGCCTAGAGTTTCTGGG 

707 SRSRINDAFRLNDNSLEFLG 

22 81 GATACAGCCAACACTTGGACCTCCTAACCAGCCCCCTGTTTCCATATGGCTGATTGTTTT 

727 IQPTLGPPNQPPVSIWLIVF 



2 3 41 TGGAGTTGTGATGGGAGTGATAGTGGTTGGCATTGTCATCCTGATCTTC ACTGGGATC AG 
747 GVVMGVIVVGIVILIFTGIR 

2401 AGATCGGAAGAAGAAAAATAAAGCAAGAAGTGGAGAAAATCCTTATGCCTCCATCGATAT 
767 DRKKKNKARSGENPYASIDI 

2461 TAGCAAAGGAGAAAATAATCCAGGATTCCAAAACACTGATGATGTTCAGACCTCCTTTTA 
787 SKGENNPGFQNTDDVQTSF * 



2 521 GAAAAATCTATGTTTTTCCTCTTGAGGTGATTTTGTTGTATGTAAATGTTAATTTCATGG 



2 5 81 TATAGAAAATATAAGATGATAAAGATATCATTAAATGTCAAAACTATGACTCTGTTCAGA 
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Figure 32D 

2 641 AAAAAAATTGTCCAAAGACAACATGGCCAAGGAGAGAGCATCTTCATTGACATTGCTTTC 27 00 

2701 AGTATTTATTTCTGTCTCTGGATTTGACTTCTGTTCTGTTTCTTAATAAGGATTTTGTAT 27 6 0 

2 7 61 TAGAGTATATTAGGGAAAGTGTGTATTTGGTCTCAC AGGCTGTTCAGGGATAATCTAAAT 2 82 0 

2 821 GTAAATGTCTGTTGAATTTCTGAAGTTGAAAACAAGGATATATCATTGGAGC AAGTGTTG 2 88 0 

2 881 GATCTTGTATGGAATATGGATGGATCACTTGTAAGGACAGTGCCTGGGAACTGGTGTAGC 2940 

2 941 TGCAAGGATTGAGAATGGCATGCATTAGCTCACTTTCATTTAATCCATTGTCAAGGATGA 3 000 

3 001 CATGCTTTCTTCACAGTAACTCAGTTCAAGTACTATGGTGATTTGCCTACAGTGATGTTT 3 060 
3 061 GGAATCGATCATGCTTTCTTCAAGGTGACAGGTCTAAAGAGAGAAGAATCCAGGGAACAG 312 0 
3121 GTAGAGGACATTGCTTTTTCACTTCCAAGGTGCTTGATCAACATCTCCCTGAC AACACAA 3180 
3181 AACTAGAGCCAGGGGCCTCCGTGAACTCCCAGAGCATGCCTGATAGAAACTCATTTCTAC 3240 
3241 TGTTCTCTAACTGTGGAGTGAATGGAAATTCCAACTGTATGTTCACCCTCTGAAGTGGGT 33 00 
3 3 01 ACCCAGTCTCTTAAATCTTTTGTATTTGCTC ACAGTGTTTGAGCAGTGCTGAGCACAAAG 3360 
33 61 CAGACACTCAATAAATGCTAGATTTACACACTCAAAAAAAAAAAA 3 405 
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Figure 33A 



1 ATGCATCTTATCGACTACCTGCTCCTCCTGCTGGTTGGACTACTGGCCCTTTCTCATGGC 60 

1MHLIDYLLLLLVGLLALSHG 20 

6 1 C AGC TGC AC GTTGAGC ATGATGGTGAGAGTTGC AGTAAC AGC TC C C AC C AGC AGATTC TG 12 0 

21QLHVEHDGESCSNSSHQQIL 40 

121 GAGACAGGTGAGGGCTCCCCCAGCCTCAAGATAGCCCCTGCCAATGCTGACTTTGCCTTC 180 

41 ETGEGSPSLKIAPANADFAF 60 

181 CGCTTCTACTACCTGATCGCTTCGGAGACCCCGGGGAAGAACATCTTTTTCTCCCCGCTG 240 

61 RFYYLIASETPGKNIFFS PL 80 

241 AGCATCTCGGCGGCCTACGCCATGCTTTCCCTGGGGGCCTGCTCACACAGCCGCAGCCAG 3 00 

81SISAAYAMLSLGACSHSRSQ 100 

3 01 atccttgagggcctgggcttcaacctcaccgagctgtctgagtccgatgtccataggggc 3 60 

101 ileglgfnltelse'sdvhrg 120 

3 61 ttccagcacctcctgcac actctc aacctccccggccatgggctggaaac acgcgtgggc 42 0 

121 FQHLLHTLNLPGHGLETRVG 140 

421 AGTGCTCTGTTCCTGAGCCACAACCTGAAGTTCCTTGCAAAATTCCTGAATGACACCATG 480 

141 SALFLSHNLKFLAKFLNDTM 160 

481 GCCGTCTATGAGGCTAAACTCTTCCACACCAACTTCTACGACACTGTGGGCACAATCCAG 540 

161 AVYEAKLFHTNFYDTVGTIQ 180 

541 CTTATCAACGACCACGTCAAGAAGGAAACTCGAGGGAAGATTGTGGATTTGGTCAGTGAG 600 

181 LINDHVKKETRGKIVDLVSE 200 

6 01 CTCAAGAAGGACGTCTTGATGGTGCTGGTGAATTACATTTACTTCAAAGCCCTGTGGGAG 660 
201 LKKDVLMVLVNY IYFKALWE 220 

661 AAACCATTCATTTCCTCAAGGACCACTCCCAAAGACTTCTATGTTGATGAGAACACAACA 72 0 

221 KPFI SSRTTPKDFYVDENTT 240 

721 GTCCGGGTGCCCATGATGCTGCAGGACCAGGAGCATCACTGGTATCTTCATGACAGATAC 7 80 

241 VRVPMMLQDQEHHWYLHDRY 260 

7 81 TTGCCCTGCTCGGTGCTACGGATGGATTACAAAGGAGACGCAACCGTGTTTTTCATTCTC 840 
261 L PC SVLRMDYKGDATVF'F I L 280 
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Figure 33B 



841 CCTAACCAAGGC AAAATGAGGGAGATTGAAGAGGTTCTGACTCCAGAGATGCTAATGAGG 
281 PNQGKMRE I EEVLTPEMLMR 



901 TGGAACAACTTGTTGCGGAAGAGGAATTTTTACAAGAAGCTAGAGTTGCATCTTCCCAAG 
301 WNNLLRKRNFYKKLELHLPK 



9 61 TTCTCCATTTCTGGCTCCTATGTATTAGATCAGATTTTGCCCAGGCTGGGCTTCACGGAT 
321 FSISGSYVLDQILPRLGFTD 



1021 CTGTTCTCCAAGTGGGCTGACTTATCCGGCATCACC AAAC AGCAAAAACTGGAGGCATCC 

341 LFSKWADLSGITKQQKLEAS 

1081 AAAAGTTTCCACAAGGCCACCTTGGACGTGGATGAGGCTGGCACCGAGGCTGCAGCAGCC 

361 KSFHKATLDVDEAGTEAAAA 

1141 ACCACGTTCGCGATCAAATTCTTCTCTGCCCAGACCAATCGCCACATCCTGCGATTCAAC 

381 TTFAIKFFSAQTNRHILRFN 

12 01 CGGCCCTTCCTTGTGGTGATCTTTTCCACCAGCACCCAGAGTGTCCTCTTTCTGGGCAAG 

401 RPFLVVIFSTSTQSVLFLGK 




12 61 GTCGTCGACCCCACGAAACCATAG 12 84 
421 VVDPTKP* 428 
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Figure 34A 

1 ATGCATCTTATCGACTACCTGCTCCTCCTGCTGGTTGGACTACTGGCCCTTTCTCATGGC 6 0 

1MHLIDYLLLLLVGLLALSHG 20 

6 1 CAGCTGCACGTTGAGCATGATGGTGAGAGTTGCAGTAACAGCTCCCACCAGCAGATTCTG 12 0 

21 QLHVEHDGESCSNSSHQQIL 40 

121 GAGACAGGTGAGGGCTCCCCCAGCCTCAAGATAGCCCCTGCCAATGCTGACTTTGCCTTC 180 

41 ETGEGSPSLKIAPANADFAF 60 

181 CGCTTCTACTACCTGATCGCTTCGGAGACCCCGGGGAAGAACATCTTTTTCTCCCCGCTG 240 

61 RFYYLIASETPGKNIFFSPL 80 

2 41 AGCATCTCGGCGGCCTACGCCATGCTTTCCCTGGGGGCCTGCTCACACAGCCGCAGCCAG 3 00 

81 SI SAAYAMLSLGACSHSRSQ 100 

^ 3 01 ATCCTTGAGGGCCTGGGCTTCAACCTCACCGAGCTGTCTGAGTCCGATGTCCATAGGGGC 3 60 

M 101 ILEGLGFNLTELSESDVHRG 120 

Ci 

%if ...... 

1J? 3 61 TTCCAGCACCTCCTGCACACTCTCAACCTCCCCGGCCATGGGCTGGAAACACGCGTGGGC 42 0 



IS"! ; 
!i : 



121 FQHLLHTLNLPGHGLETRVG 140 

421 AGTGCTCTGTTCCTGAGCCACAACCTGAAGTTCCTTGCAAAATTCCTGAATGACACCATG 480 

141 SALFLSHNLKFLAKFLNDTM 160 

481 GCCGTCTATGAGGCTAAACTCTTCCACACCAACTTCTACGACACTGTGGGCACAATCCAG 540 

161 AVYEAKLFHTNFYDTVGTIQ 180 

541 CTTATC AACGACCACGTC AAGAAGGAAACTCGAGGGAAGATTGTGGATTTGGTC AGTGAG 600 

181 LINDHVKKETRGKIVDLVSE 200 



6 01 CTCAAGAAGGACGTCTTGATGGTGCTGGTGAATTACATTTACTTCAAAGCCCTGTGGGAG 660 
201 LKKDVLMVLVNYIYFKALWE 220 



6 61 AAACCATTC ATTTCCTCAAGGACC ACTCCC AAAGACTTTTATGTTGATGAGAAC ACAACA 72 0 

221 KPFISSRTTPKDFYVDENTT 240 

721 GTCCGGGTGCCCATGATGCTGCAGGACCAGGAGCATCACTGGTATCTTCATGACAGATAC 7 80 

241 VRVPMMLQDQEHHWYLHDRY 260 



781 
261 



TTGCCCTGCTCGGTGCTACGGATGGATTACAAAGGAGACGCAACCGTGTTTTTCATTCTC 840 
LPCSVLRMDYKGDATVFFIL 280 



D0053 NP 

Figure 34B 

841 C C T AAC C AAGGC AAAATG AGGG AG ATTG AAG AGGTT C TG AC T C C AG AG ATGC T AATGAGG 9 00 

281 PNQGKMREIEEVLTPEMLMR 300 

9 01 TGGAACAACTTGTTGCGGAAGAGGAATTTTTACAAGAAGCTAGAGTTGCATGTTCCCAAG 9 60 

301 WNNLLRKRNFYKKLELHLPK 320 

9 61 TTCTCCATTTCTGGCTCCTATGTATTAGATCAGATTTTGCCCAGGCTGGGCTTCACGGAT 102 0 

321 FSISGSYVLDQILPRLGFTD 340 

1021 CTGTTCTCCAAGTGGGCTGACTTATCCGGCATCACCAAACAGCAAAAACTGGAGGCATCC 108 0 

341 LFSKWADLSGITKQQKLEAS 360 

1081 AAAAGTTTCCAC AAGGCC ACCTTGGACGTGGATGAGGCTGGC ACCGAGGCTGCAGC AGCC 1140 

361 KSFHKATLDVDEAGTEAAAA 380 

H 1141 ACCACGTTCGCGATC AAATTCTTCTCTGCCCAGACCAATCGCCACATCCTGCGATTCAAC 12 0 0 

P 381 TTFAIKFFSAQTNRHILRFN 400 

q 

| ]1 12 01 CGGCCCTTCCTTGTGGTGATCTTTTCCACCAGCACCCAGAGTGTCCTCTTTCTGGGCAAG 1260 

401 RPFLVVIFSTSTQSVLFLGK 420 



12 61 GTCGTCGACCCCACGAAACCATAG 12 84 



* 421 VVDPTKP* 428 

m 
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Figure 35A 



1 ATGCATCTTATCGACTACCTGCTCCTCCTGCTGGTTGGACTACTGGCCCTTTCTCATGGC 60 
1MHLIDYLLLLLVGLLALSHG 20 



6 1 CAGCTGCACGTTGAGCATGATGGTGAGAGTTGCAGTAACAGCTCCCACCAGCAGATTCTG 12 0 

21QLHVEHDGESCSNSSHQQIL 40 

121 GAGACAGGTGAGGGCTCCCCCAGCCTCAAGATAGCCCCTGCCAATGCTGACTTTGCCTTC 180 

41 ETGEGS PSLKIAPANADFAF 60 

181 CGCTTCTACTACCTGATCGCTTCGGAGACCCCGGGGAAGAACATCTTTTTCTCCCCGCTG 2 40 

61RFYYLIASE TPGKNIFFSPL 80 

241 AGCATCTCGGCGGCCTACGCCATGCTTTCCCTGGGGGCCTGCTCACACAGCCGCAGCCAG 3 00 

81 SI SAAYAMLSLGACSHSRSQ 100 

iUsfe ...... 

* 3 01 ATCCTTGAGGGCCTGGGCTTCAACCTCACCGAGCTGTCTGAGTCCGATGTCCATAGGGGC 3 60 

W 101 ILEGLGFNLTELSESDVHRG 120 

>h.l 
t'\ 

■izsf ...... 

|j| 3 61 TTCCAGCACCTCCTGCACACTCTCAACCTCCCCGGCCATGGGCTGGAAACACGCGTGGGC 420 

lJIJ 121 FQHLLHTLNLPGHGLETRVG 140 

il f i 

01 ...... 

421 AGTGCTCTGTTCCTGAGCCACAACCTGAAGTTCCTTGCAAAATTCCTGAATGACACCATG 480 

! £ 141 SALFLSHNLKFLAKFLNDTM 160 



481 GCCGTCTATGAGGCTAAACTCTTCCACACCAACTTCTACGACACTGTGGGCACAATCCAG 540 
:-| 161 AVYEAKLFHTNFYDTVGTIQ 180 



541 CTTATCAACGACCACGTCAAGAAGGAAACTCGAGGGAAGATTGTGGATTTGGTCAGCGAG 600 
181 LINDHVKKETRGKIVDLVSE 200 



601 CTCAAGAAGGACGTCTTGATGGTGCTGGTGAATTACATTTACTTCAAAGCCCTGTGGGAG 6 60 
201 LKKDVLMVLVNYIYFKALWE 220 



6 61 AAACCATTCATTTCCTCAAGGACCACTCCCAAAGACTTCTATGTTGATGAGAACACAACA 72 0 
221 KPFI SSRTTPKDFYVDENTT 240 



721 GTCCGGGTGCCCATGATGCTGCAGGACCAGGAGCATCACTGGTATCTTCATGACAGATAC 78 0 
241 VRVPMMLQDQEHHWYLHDRY 260 



7 81 TTGCCCTGCTCGGTGCTACGGATGGATTACAAAGGAGACGCAACCGTGTTTTTCATTCTC 840 
261 L PC SVLRMDYKGDATVFF I L 280 
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Figure 35B 

.-••** 

841 CCTAACCAAGGCAAAATGAGGGAGATTGAAGAGGTTCTGACTCCAGAGATGCTAATGAGG 9 00 

281 PNQGKMREIEEVLTPEMLMR 300 

901 TGGAACAACTTGTTGCGGAAGAGGAATTTTTACAAGAAGCTAGAGTTGCATCTTCCCAAG 9 60 

301 WNNLLRKRNFYKKLELH. LPK 320 

961 TTCTCCATTTCTGGCTCCTATGTATTAGATCAGATTTTGCCCAGGCTGGGCTTCACGGAT 102 0 

321 FSISGSYVLDQILPRLGFTD 340 

• 

1021 CTGTTCTCCAAGTGGGCTGACTTATCCGGCATCACCAAACAGCAAAAACTGGAGGCATCC 1080 

341 LFSKWADLSGITKQQKLEAS 360 

1081 AAAAGTTTCCAC AAGGCCACCTTGGACGTGGATGAGGCTGGC ACCGAGGCTGCAGCAGCC 1140 

361 KSFHKATLDVDEAGTEAAAA 380 

1141 ACC ACGTTCGCGATC AAATTCTTCTCTGCCCAGACCAATCGCCACATCCTGCGATTCAAC 12 00 

381 TTFAIKFFSAQTNRHILRFN 400 

12 01 CGGCCCTTCCTTGTGGTGATCTTTTCCACCAGCACCCAGAGTGTCCTCTTTCTGGGCAAG 12 6 0 

401 RPFLVVIFSTSTQSVLFLGK 420 

12 61 GTCGTCGACCCCACGAAACCATAG 12 84 
421 VVDPTKP* 428 
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Figure 36A 

1 ATGCATCTTATCGACTACCTGCTCCTCCTGCTGGTTGGACTACTGGCCCTTTCTCATGGC 60 

1MHLIDYLLLLLVGLLALSHG 20 

6 1 CAGCTGC ACGTTGAGCATGATGGTGAGAGTTGCAGTAACAGCTCCCACC AGCAGATTCTG 12 0 

21QLHVEHDGESCSNSSHQQIL 40 

121 GAGACAGGTGAGGGCTCCCCCAGCCTCAAGATAGCCCCTGCCAATGCTGACTTTGCCTTC 180 

41 ETGEGSPSLKIAPANADFAF 60 

181 CGCTTCTACTACCTGATCGCTTCGGAGACCCCGGGGAAGAACATCTTTTTCTCCCCGCTG 240 

61 RFYYLIASETPGKNIFFSPL 80 

241 AGCATCTCGGCGGCCTACGCC ATGCTTTCCCTGGGGGCCTGCTCACAC AGCCGC AGCCAG 3 00 

81 S I SAAYAMLSLGACSHSRSQ 100 

3 01 ATCCTTGAGGGCCTGGGCTTCAACCTCACCGAGCTGTCTGAGTCCGATGTCCATAGGGGC 3 60 

101 ILEGLGFNLTELSESDVHRG 120 

3 61 TTCCAGCACCTCCTGCACACTCTCAACCTCCCCGGCCATGGGCTGGAAACACGCGTGGGC 420 

tfl 121 FQHLLHTLNLPGHGLETRVG 140 

in 

£n ...... 

m 421 AGTGCTCTGTTCCTGAGCCACAACCTGAAGTTCCTTGCAAAATTCCTGAATGACACCATG 480 

[ b . 141 SALFLSHNLKF L" AKFLNDTM 160 

hi 

W 481 GCCGTCTATGAGGCTAAACTCTTCCACACCAACTTCTACGACACTGTGGGCACAATCCAG 540 

W 161 AVYEAKLFHTNFYDTVGTIQ 180 

P 

fa* ...... 

541 CTTATCAACGACCACGTCAAGAAGGAAACTCGAGGGAAGATTGTGGATTTGGTCAGNGAG 600 

181 LINDHVKKETRGKIVDLVSE 200 

601 CTCAAGAAGGACGTCTTGATGGTGCTGGTGAATTACATTTACTTCAAAGCCCTGTGGGAG 660 

201 LKKDVLMVLVNYIYFKALWE 220 

661 AAACCATTCATTTCCTCAAGGACCACTCCCAAAGACTTNTATGTTGATGAGAACACAACA 720 

221 KPFI SSRTTPKDFYVDENTT 240 

721 GTCCGGGTGCCCATGATGCTGCAGGACCAGGAGCATCACTGGTATCTTCATGACAGATAC 7 80 

241 VRVPMMLQDQEHHWYLH. DRY 260 



a 



7 81 TTGCCCTGCTCGGTGCTACGGATGGATTACAAAGGAGACGCAACCGTGTTTTTCATTCTC 840 
261 LPCSVLRMDYKGDATVFFIL 280 
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Figure 36B 



'•is zf 



jPj! 



841 CCTAACCAAGGCAAAATGAGGGAGATTGAAGAGGTTCTGACTCCAGAGATGCTAATGAGG 900 

281 PNQGKMREIEEVLTPEMLMR 300 

9 01 TGGAACAACTTGTTGCGGAAGAGGAATTTTTACAAGAAGCTAGAGTTGCATCTTCCCAAG 960 

301 WNNLLRKRNFYKKLELHLPK 320 

9 61 TTCTCC ATTTCTGGCTCCTATGTATTAGATCAGATTTTGCCCAGGCTGGGCTTC ACGGAT 102 0 

321 FSISGSYVLDQILPRLGFTD 340 

1021 CTGTTCTCCAAGTGGGCTGACTTATCCGGC ATCACCAAACAGCAAAAACTGGAGGCATCC 108 0 

341 LFSKWADLSGITKQQKLEAS 360 

1081 AAAAGTTTCCACAAGGCC ACCTTGGACGTGGATGAGGCTGGCACCGAGGCTGCAGCAGCC 1140 

361 KSFHKATLDVDEAGTEAAAA 380 

1141 ACCACGTTCGCGATCAAATTCTTCTCTGCCCAGACC AATCGCCACATCCTGCGATTCAAC 12 00 

381 TTFAIKFFSAQTNRHILRFN 400 



U1 12 01 CGGCCCTTCCTTGTGGTGATCTTTTCCACCAGCACCCAGAGTGTCCTCTTTCTGGGCAAG 12 6 0 

y;f 401 RPFLVVIFSTSTQSVLFLGK 42 0 



12 61 GTCGTCGACCCCACGAAACCATAG 12 84 
421 VVDPTKP* 428 



D0053NP 

Figure 37A 

1 CGCCCAACCCAAGTTCAAAGGCTGATAAGAGAGAAAATCTCATGAGGAGGTTTTAGTCTA 6 0 

6 1 GGGAAAGTCATTCAGTGGATGTGATCTTGGCTCACAGGGGACGATGTCAAGCTCTTCCTG 12 0 

1 MSSSSW6 

121 GCTCCTTCTCAGCCTTGTTGCTGTAACTGCTGCTCAGTCCACCATTGAGGAACAGGCCAA 180 

7 LLLSLVAVTAAQSTIEEQAK 26 

181 GACATTTTTGGACAAGTTTAACCACGAAGCCGAAGACCTGTTCTATCAAAGTTCACTTGC 24 0 

27 TFLDKFNHEAEDLFYQSSLA 46 

241 TTCTTGGAATTATAACACCAATATTACTGAAGAGAATGTCCAAAACATGAATAATGCTGG 3 00 

47 SWNYNTNI TEENVQNMNNAG 66 

3 01 GGACAAATGGTCTGCCTTTTTAAAGGAACAGTCCACACTTGCCCAAATGTATCCACTACA 3 60 

67 DKWSAFLKEQSTLAQMYPLQ 86 

£] 

Q 3 61 AGAAATTCAGAATCTCACAGTCAAGCTTCAGCTGCAGGCTCTTCAGCAAAATGGGTCTTC 42 0 

i_U 87 EIQNLTVKLQLQALQQNGSS 



106 



421 AGTGCTCTC AGAAGACAAGAGCAAACGGTTGAACAC AATTCTAAATAC AATGAGC ACC AT 480 

107 VLSEDKSKRLNTILNTMSTI 126 

a: 

M . . . - 

fij 481 CTACAGTACTGGAAAAGTTTGTAACCC AGATAATCC ACAAGAATGCTTATTACTTGAACC 540 

CI 127 YSTGKVCNPDNPQECLLLEP 146 

H 541 AGGTTTGAATGAAATAATGGC AAAC AGTTTAGACTAC AATGAGAGGCTCTGGGCTTGGGA 600 

147 GLNEIMANSLDYNERLWAWE 166 

...••* 

601 AAGCTGGAGATCTGAGGTCGGCAAGCAGCTGAGGCCATTATATGAAGAGTATGTGGTCTT 660 

167 SWRS EVGKQLRPLYEEYVVL 186 

661 GAAAAATGAGATGGCAAGAGCAAATCATTATGAGGACTATGGGGATTATTGGAGAGGAGA 72 0 

187 KNEMARANHYEDYGDYWRGD 206 

721 CTATGAAGTAAATGGGGTAGATGGCTATGACTACAGCCGCGGCCAGTTGATTGAAGATGT 7 80 

207 YEVNGVDGYDYSRGQL I EDV 226 

7 81 GGAACATACCTTTGAAGAGATTAAACCATTATATGAACATCTTCATGCCTATGTGAGGGC 840 

227 EHTFEEIKPLYEHLHAYVRA 246 

841 AAAGTTGATGAATGCCTATCCTTCCTATATCAGTCCAATTGGATGCCTCCCTGCTCATTT 900 

247 KLMNAYPSYI SPIGCLPAHL 266 
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Figure 37B 



9 01 GCTTGGTGATATGTGGGGTAGATTTTGGACAAATCTGTACTCTTTGACAGTTCCCTTTGG 9 6 0 

267 LGDMWGRFWTNLYSLTVPFG 286 

9 61 AC AGAAACCAAACATAGATGTTACTGATGCAATGGTGGACCAGGCCTGGGATGCACAGAG 102 0 

287 QKPN I DVTDAMVDQAWDAQR 306 

1021 AATATTCAAGGAGGCCGAGAAGTTCTTTGTATCTGTTGGTCTTCCTAATATGACTCAAGG 1080 

307 I FKEAEKFFVSVGLPNMTQG 326 



1081 ATTCTGGGAAAATTCCATGCTAACGGACCC AGGAAATGTTC AGAAAGCAGTCTGCCATCC 114 0 

327 FWENSMLTDPGNVQKAVCHP 346 

CACAGCTTGGGACCTGGGGAAGGGCGACTTC AGGATCCTTATGTGC AC AAAGGTGACAAT 12 00 

TAWDLGKGDFRI LMCTKVTM 366 



GGACGACTTCCTGACAGCTCATCATGAGATGGGGCATATCCAGTATGATATGGCATATGC 126 0 

DDFLTAHHEMGHIQYDMAYA 386 

• ♦ ♦ • . 

TGC ACAACCTTTTCTGCTAAGAAATGGAGCTAATGAAGGATTCCATGAAGCTGTTGGGGA 132 0 

AQPFLLRNGANEGFHEAVGE 406 



AATCATGTCACTTTCTGCAGCCACACCTAAGCATTTAAAATCCATTGGTCTTCTGTCACC 13 8 0 
IMSLSAATPKHLKSIGLLSP 426 



CGATTTTCAAGAAGACAATGAAACAGAAATAAACTTCCTGCTCAAACAAGCACTCACGAT 1440 
DFQEDNETEINFLLKQALTI 446 



1441 TGTTGGGACTCTGCCATTTACTTACATGTTAGAGAAGTGGAGGTGGATGGTCTTTAAAGG 1500 
447 VGTLPFTYMLEKWRWMVFKG 466 



1501 GGAAATTCCCAAAGACCAGTGGATGAAAAAGTGGTGGGAGATGAAGCGAGAGATAGTTGG 156 0 
467 EI PKDQWMKKWWEMKREIVG 486 



1561 GGTGGTGGAACCTGTGCCCCATGATGAAACATACTGTGACCCCGCATCTCTGTTCCATGT 162 0 

487 VVEPVPHDETYCDPASLFHV 506 

1621 TTCTAATGATTACTCATTC ATTCGATATTACACAAGGACCCTTTACCAATTCCAGTTTCA 168 0 

507 SNDYSFIRYYTRTLYQFQFQ 526 



1141 
347 
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r: 
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1201 
367 
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1261 
387 



m 
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U! 
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1321 
407 



1381 
427 



16 81 AGAAGCACTTTGTCAAGCAGCTAAACATGAAGGCCCTCTGCACAAATGTGACATCTCAAA 1740 
527 EALCQAAKHEGPLHK CDISN 546 
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Figure 37C 



17 41 CTCTACAGAAGCTGGACAGAAACTGTTCAATATGCTGAGGCTTGGAAAATCAGAACCCTG 180 0 

547 STEAGQKLFNMLRLGKSEPW 566 

1801 GACCCTAGCATTGGAAAATGTTGTAGGAGCAAAGAACATGAATGTAAGGCCACTGCTCAA 1860 

567 TLALENVVGAKNMNVRPLLN 586 



18 61 CTACTTTGAGCCCTTATTTACCTGGCTGAAAGACCAGAACAAGAATTCTTTTGTGGGATG 192 0 

587 YFEPLFTWLKDQNKNSFVGW 606 

1921 GAGTACCGACTGGAGTCCATATGCAGACCAAAGCATCAAAGTGAGGATAAGCCTAAAATC 19 8 0 

607 STDWSPYADQSIKVRISLKS 626 

1981 AGCTCTTGGAGATAAAGC ATATGAATGGAACGACAATGAAATGTACCTGTTCCGATCATC 2 040 

627 ALGDKAYEWNDNEMYLFRSS 646 



2 041 TGTTGC ATATGCTATGAGGCAGTACTTTTTAAAAGTAAAAAATCAGATGATTCTTTTTGG 2100 
647 VAYAMRQYFLKVKNQMILFG 666 



2 

Q 
W 

{ £) 2101 GGAGGAGGATGTGCGAGTGGCTAATTTGAAACCAAGAATCTCCTTTAATTTCTTTGTCAC 2160 

|H 667 EEDVRVANLKPRISFNFFVT 686 

.isit 

u\ 

j[ E : 2161 TGCACCTAAAAACGTGTCTGATATCATTCCTAGAACTGAAGTTGAAAAGGCCATCAGGAT 222 0 

687 APKNVSDI I PRTEVEKAIRM 706 

li ii 

! asf 

•■; % ...... 

tj* 2221 GTCCCGGAGCCGTATCAATGATGCTTTCCGTCTGAATGACAACAGCCTAGAGTTTCTGGG 22 8 0 

CI 707 SRSRINDAFRLNDNSLEFLG 726 

N 

22 81 GATACAGCCAACACTTGGACCTCCTAACCAGCCCCCTGTTTCCATATGGCTGATTGTTTT 234 0 
727 IQPTLGPPNQPPVSIWLIVF 746 

2341 TGGAGTTGTGATGGGAGTGATAGTGGTTGGCATTGTCATCCTGATCTTCACTGGGATCAG 2400 
747 GVVMGVIVVGIVILIFTGIR 766 

2 401 AGATCGGAAGAAGAAAAATAAAGCAAGAAGTGGAGAAAATCCTTATGCCTCCATCGATAT 2 46 0 
767 DRKKKNKARSGENPYASIDI 786 



2 4 61 TAGCAAAGGAGAAAATAATCCAGGATTCCAAAACACTGATGATGTTCAGACCTCCTTTTA 2 52 0 
787 SKGENNPGFQNTDDVQTSF * 806 

2 521 GAAAAATCTATGTTTTTCCTCTTGAGGTGATTTTGTTGTATGTAAATGTTAATTTCATGG 2 58 0 



2 581 TATAGAAAATATAAGATGATAAAGATATCATTAAATGTCAAAACTATGACTCTGTTCAGA 



2640 
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Figure 37D 

2 641 AAAAAAATTGTCCAAAGACAACATGGCCAAGGAGAGAGCATCTTCATTGACATTGCTTTC 2700 

27 01 AGTATTTATTTCTGTCTCTGGATTTGACTTCTGTTCTGTTTCTTAATAAGGATTTTGTAT 2 760 

2761 TAGAGTATATTAGGGAAAGTGTGTATTTGGTCTCAC AGGCTGTTC AGGGATAATCTAAAT 2 82 0 

2 821 GTAAATGTCTGTTGAATTTCTGAAGTTGAAAACAAGGATATATCATTGGAGC AAGTGTTG 2 880 

2 8 81 GATCTTGTATGGAATATGGATGGATCACTTGTAAGGACAGTGCCTGGGAACTGGTGTAGC 2 94 0 

2941 TGCAAGGATTGAGAATGGCATGC ATTAGCTCACTTTCATTTAATCCATTGTCAAGGATGA 3 000 

CATGCTTTCTTCACAGTAACTCAGTTCAAGTACTATGGTGATTTGCCTACAGTGATGTTT 306 0 

GGAATCGATCATGCTTTCTTCAAGGTGACAGGTCTAAAGAGAGAAGAATCCAGGGAACAG 312 0 

GTAGAGGACATTGCTTTTTCACTTCCAAGGTGCTTGATCAACATCTCCCTGACAACACAA 318 0 

AACTAGAGCCAGGGGCCTCCGTGAACTCCCAGAGCATGCCTGATAGAAACTCATTTCTAC 3240 

TGTTCTCTAACTGTGGAGTGAATGGAAATTCCAACTGTATGTTCACCCTCTGAAGTGGGT 3 3 0 0 

ACCCAGTCTCTTAAATCTTTTGTATTTGCTCACAGTGTTTGAGCAGTGCTGAGCACAAAG 3 3 6 0 
CAGACACTCAATAAATGCTAGATTTACACACTCAAAAAAAAAAAA 3 405 
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Figure 38A 

1 ATGTTCTCTC CCTGGAAGAT ATCAATGTTT CTGTCTGTTN GTGAGGNCTC CGTGCCCACC 60 
1MFSP WKI SMF LSVX EXS VPT 20 



61 ACGGCCTCTT TCAGCGCCGA CATGCTCAAT GTCACCTTGC AAGGGCCCAC TCTTAACGGG 12 0 

21TASF SAD MLN VTLQ GPT LNG 40 

121 ACCTTTGCCC AGAGCAAATG CCCCCAAGTG GAGTGGC TGG GCTGGCTCAA CACCATCCAG 180 

41TFAQ SKC PQV EWLG WLN T I Q 60 

181 CCCCCCTTCC TCTGGGTGCT GTTCGTGCTG GCCACCCTAG AG AAC AT C T T TGTCCTCAGC 240 

61PPFL WVL FVL ATLE NIF VLS 80 



2 41 GTCTTCTGCC TGCACAAGAG CAGCTGCACG GTGGCAGAGA TCTACCTGGG GAACCTGGCC 3 00 
81VFCL HKS SCT VAEI YLG NLA 100 



301 GCAGCAGACC TGATCCTGGC CTGCGGGCTG CCCTTCTGGG CCATCACCAT CTCCAACAAC 3 60 
101 AADL I L A CGL PFWA ITI SNN 120 



3 61 TTCGACTGGC TCTTTGGGGA GACGCTCTGC CGCGTGGTGA ATGCCATTAT CTCCATGAAC 42 0 
121 FDWL FGE TLC RVVN All SMN 140 



<& ...... 

Ih'l 421 CTGTACAGCA GCATCTGTTT CCTGATGCTG GTGAGCATCG ACCGCTACCT GGCCCTGGTG 480 

g{ 141 LYSS ICF LML VSID RYL ALV 160 

H 481 AAAACCATGT CCATGGGCCG GATGCGCGGC GTGCGCTGGG CCAAGCTCTA CAGCTTGGTG 540 

jf[| 161 KTMS MGR MRG VRWA KLY SLV 180 

r 1 . 

541 ATCTGGGGGT GTACGCTGCT CCTGAGCTCA CCCATGCTGG TGTTCCGGAC CATGAAGGAG 600 

CI 181 IWGC TLL LSS PMLV FRT MKE 200 

N 

601 TACAGCGATG AGGGCCACAA CGTCACCGCT TGTGTCATCA GCTACCCATC CCTCATCTGG 660 

201 YSDE GHN VTA CVIS YPS LIW 220 



661 GAAGTGTTCA CCAACATGCT CCTGAATGTC GTGGGCTTCC TGCTGCCCCT GAGTGTCATC 72 0 
221 EVFT NML LNV VGFL LPL SVI 240 



721 ACCTTCTGCA CGATGCAGAT CATGCAGGTG CTGCGGAACA AC GAG ATGC A GAAGTTCAAG 7 80 
241 TFCT M Q I MQV LRNN EMQ KFK 260 

7 81 GAGATCCAGA CGGAGAGGAG GGCCACGGTG CTAGTCCTGG TTGTGCTGCT GCTATTCATC 840 
261 EIQT ERR A T V LVLV VLL L F I 280 



841 ATCTGCTGGC TGCCCTTCCA GATCAGCACC TTCCTGGATA CGCTGCATCG CCTCGGCATC 900 
281 ICWL PFQ 1ST FLDT LHR L G I 300 
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Figure 38B 

901 CTCTCCAGCT GCCAGGACGA GCGCATCATC GANGTAATCA CACAGATCGC CTCCTTCATG 960 
301 LSSC QDE RII DVIT QIA SFM 320 

961 GCCTACAGCA ACAGCTGCCT CAACCCACTG GTGTACGTGA TCGTGGGCAA GCGCTTCCGA 102 0 
321AYSN SCL NPL VYVI VGK RFR 340 

1021 AAGAAGTCTT GGGAGGTGTA CCAGGGAGTG TGCCAGAAAG NGGGC TGC AG GTCAGAACCC 1080 
341 KKSW EVY QGV CQKX GCR SEP 360 

1081 ATTCAGATGG AGAACTCCAT GGGCACACTG CGGACCTCCA TCTCCGTGGA AC GC C AG AT T 1140 
361 IQME NSM GTL RTSI SVE R Q I 380 

1141 CACAAACTGC AGGAC TGGGC AGGGAGCAGA CAGTGAGCAA ACGCCAGCAG GGCTGCTGTG 1200 
381 HKLQ DWA GSR Q * 391 

12 01 AATTTGTGTA AGGATTGAGG GACAGTTGCT TTTCAGCATG GGCCCAGGAA TGC C AAGGAG 1260 

12 61 ACATCTATGC AC G AC C TTGG GAAATGAGTT GATGTCTCCG GTAAAACACC GGAGACTAAT 132 0 
1321 TCCTGCCCTG CCCAATTTTG CAGGGAGCAT GGCTGTGAGG AT GGGGTGAA CTCACGCACA 1380 

13 81 GCCAAGGACT CCAAAATCAC AACAGCATTA CTGTTCTTAT TT GC TGC C AC ACCTGAGCCA 1440 
1441 GCCTGCTCCT TCCCAGGAGT GGAGGAGGCC TGGGGGGAGG GAGAGGAGTG ACTGAGCTTC 1500 
1501 CCTCCCGTGT GTTCTCCGTC CCTGCCCCAG CAAGACAACT TAGATCTCCA GGAGAACTGC 1560 
1561 CATCCAGCTT TGGTGCAATG GCTGAGTGCA CAAGTGAGTT GTTGCCCTGG GTTTCTTTAA 162 0 
1621 TCTATTCAGC TAGAACTTTG AAGGACAATT T C T TGC AT T A ATAAAGGTTA AGCCCTGAGG 1680 
1681 GGTCCCTGAT AACAACCTGG AGACCAGGAT TTTATGGCTC CCCTCACTGA TGGACAAGGA 1740 
1741 GGTCTGTGCC AAAGAAGAAT CCAATAAGCA CATATTGAGC ACTTGCTGTA TATGCAGTAT 1800 
1801 TGAGC AC TGT AGGCAAGACC CAAGAAAGAG AAGGAGC CAT CTCCATCTTG AAGGAACTCA 1860 
1861 AAGACTCAAG TGGGAACGAC TGGGC ACTGC CACCACCAGA AAGCTGTTCG ACGAGACGGT 192 0 
1921 CGAGCAGGGT GC TGTGGGTG ATATGGACAG CAGAAGGGGG AG AC C AAGGT TCCAGCTCAA 1980 
1981 CCAATAACTA TTGCACAACC ACCTGTCCCT GCCTCAGTTC CCTTTTATGT AACATGAAGT 2 040 
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Figure 38C 

2 041 CGTTGTGAGG GTTAAAGGCA GTAACAGGTA TAAAGTACTT AGAAAAGCAA AGGGTGCTAC 2100 
2101 GTACATGTGA GGCATCATTA CGCAGACGTA AC TGGGAT AT GTTTACTATA AGGAAAAGAC 2160 
2161 ACTGAGGTCT AGAAATAGCT CCGTGGAGCA GAATCAGTAT TGGGAGCCGG TGGCGGTGTG 22 2 0 
2221 AAGCACCAGT GTCTGGCACA CAGTAGGTGC TCATTGGCTC CCTTCCACCT GTCATTCCCA 22 80 
2281 CCACCCTGAG GCCCCAACCG CCACACACAC AGGAGC AT T T GGAGAGAAGG CCATGTCTTC 2 340 
2 341 AAAGTC T GAT TTGTGATGAG GCAGAGGAAG ATATTTCTAA TCGGTCTTGC CCAGAGGATC 2400 
2401 ACAGTGCTGA GACCCCCCAC CACCAGCCGG T AC C TGGGAA GGGGGAGAGT GCAGGCCTGC 2460 
2461 TCAGGGACTG TTCCTGTCTC AGCAACCAAG GGATTGTTCC TGTCAATCAA TGGTTTATTG 2 52 0 
2 521 GAAGGTGGCC CAGTATGAGC CCTAGAAGAG TGTGAAAAGG AATGGCAATG GTGTTCACCA 2 580 
2581 TCGGCAGTGC CAGGGCAGCA CTCATTCACT TGATAAATGA ATATTTATTA GCTGGTTGGA 2 640 
2 641 GAGCTAGAAC CTGGAGAGCT AGAACCTGGA GAACTAGAAC CTGGAGGGCT AGAAC C TGGA 2700 
2701 GAGGCTAGAA CCAAGAAGGG CTAGAACCTG GAGGGGCTAG AACCTAGAGA AGCTAAAACC 27 60 
27 61 TGAGCTAGAA GCTGGAGGAC TAGAACCTGG AGGGC TGGAA TCTGAAGGGC TAGAACCTGG 2 820 
2 821 AGGGC TGGAA TCTGGAGAGC TAGAACCTGG AGGGC T AG AA CCTGGAGGGC TAGAACCTAG 2 880 
2 881 AAGGGCTAGA ACCTGGAGGG CTGGAATCTG GAGAGCTAGA AC C TGGAGGG CTAGAACCTG 2 940 

2 941 GAGGGCTAGA ACCTAGAAGG GCTAGAACCT GGAGGGC TAG AACCTGGCAG GTTAGAACCT 3000 

3 001 AGAAGGGCTA GAACCTGGAG AGCCAGAACC TGGAGGGC T A GAAC C TGGAA GGGCT AGAAC 3 060 
3 061 CTGTAGAGCT AGAAC ATGGA GAGCTAGAAC CCGGCAGGCT AGAAC CTGGC AAGC T AGAAC 3120 
3121 CTGGAGGGAA TGAACCTGGA GGGC T AGAAC CTGGAGAATG AG AAAAATT T ACATGGCAAA 3180 
3181 GAGCCCATAA AT C C TG AC C A ATCCAACTCT GAATTTTAAA GCAAAAGCGT GAAAAAAAAG 3240 
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Figure 38D 

3241 ATTCCCTCCT TACCCCCAAC CCACTCTTTT TTCCCACCAC CCACTCTCCT CTGCCTCAGT 3300 
3301 AAGTATCTGG AGGAAGAAAA CAGGTGAAAG AAGAAGTAAA AACCATTTAG TATTAGTATT 33 60 
33 61 AGAATGAAGT CAAACTGTGC CACACATGGT GAATGAAAAA AAAAAAAAAG AGGCTGTGTT 3 42 0 
3421 TTGTCACACA GGGCAGTCAT TCAGCACCAG AGCACGTGAT GGTCTGAGAC TCTCTTAGGA 3480 
3481 GCAGAGCTCT GCCGCAATGG CCATGTGGGG ATCCACACCT GGTCTGAGGG GCAACTGAGT 3 540 
3 541 CTGCGGGAGA AGAGCGGCCC TATGCATGGT GTAGATGCCC TGATAAAGAA CATCTGTCCT 3 600 
3601 GTGAAAG AC T CAATGAGCTG TTATGTTGTA AACAGGAAGC ATTTCACATC CAAACGAGAA 3 660 
3 661 AATCATGTAA ACATGTGTCT TTTCTGTAGA GCATAATAAA TGGATGAGGT TTTTGCAAAA 372 0 
3721 AAAAAAAAAA AAA 3733 
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1 CACCCTATCC T AC AC TACT A 



61 TGCCCACCCT GGCTCCCAAA 



121 GCAGCCAAAC GCCTCCTTCT 
181 AGGCCTGAAG GAACAGGCCG 



241 CCAGCGCCGG CATCTGGAGA 
1 



3 01 GTCCTCCTCT GTGCTTGTGC 
13 V L L C ACA 



361 GTGAGAAATT GTTCCACCAA 
33VRNC STN 



421 TCACTCACAG CCCTCCGCCA 
53SLTA LRQ 



481 GGCACAGATG CTCACATGAA 
73GTDA HMN 



541 ACAGGCTTTA CAGGGTCTGC 
93TGFT GSA 



601 ACCGACAGTC GCTACTGGAC 
113 T D S R Y W T 

661 AAGGAAGTTG GCACCACTCC 
133 K E V G T T P 



721 CGTGTGGGTT TTGACCCCTT 
153 R V G F D P F 



781 CTCCAAGGCT CTAACAGACA 
173 L Q G S N R Q 

841 GGATCAGAGA GGCCACCGGT 
193 G S E R P P V 



901 GGGAGCACTT GGCAGGAGAA 
213 G S T W Q E K 



Figure 41A 



GGAACTTGCA CAGTCCGCCT 



ACCCTCCAAA AC AAAAGAC C 



TGACGCCAGC CCCCACCCTC 



GGGAAGGAGC CCTCCCTCTC 



CCCTATGGCC CGGGCTCACT 
MA R A H W 



CTGGGGCCAC AC AAAGC C AC 
W G H T K P L 



CCCCCCTTAC CTTCCAGTTA 
P P Y L P V T 



GCAGATGCAG ACCCAGAATC 
Q M Q T Q N L 



CGAGTACATC GGCCAACATG 
E Y I G Q H D 



AGGAAC TGC A GTGGTGAC T A 
G T A V V T M 



TCAGGCTGAG CGGCAAATGG 
Q A E R Q M D 



T ATTGTC AC C TGGCTCCTCA 
I V T W L L T 



CCTCTTGTCC ATTGACACCT 
L L S I D T W 



GCTGGTGTCC ATCACAACCA 
L V S I T T N 



TCCAAATCAA CCCATTTATG 
P N Q P I Y A 



AGTATCTGGC GTCCGAAGCC 
V S G V R S Q 



CGGGCAGCCC AAAGCTCCTC 60 
AGAAAAGCAC TCTCCACCCA 120 



TGTCTGCTCG AGCCCAGGAA 180 



TCCCTTGTCC CTCCATCCAC 240 



GGGGCTGCTG CCCCTGGCTG 3 00 
G C C P W L 12 



TGGACCTTGG AGGGCAGGAT 3 60 
D L G G Q D 32 

CTGTGGTCAA T AC C AC AATG 42 0 
V V N T T M 52 



TCTCAGCCTA CATCATCCCA 480 
SAY IIP 72 



AC G AG AGGC G TGCGTGGATT 540 
ERR A W I 92 

TGAAGAAAGC AGCTGTCTGG 600 
K K A A V W 112 



ACTGTAATTG GGAGCTCCAT 660 
C N W E L H 132 

CCGAGATTCC TGCTGGAGGG 72 0 
E I P A G G 152 



GGGAGAGT T A TGATCTGGCC 780 
E S Y D L A 172 



ATCTTGTGGA CCTGGTATGG 840 
LVD L V W 192 



CCCTGCAGGA GGCATTCACA 900 
L Q E AFT 212 



AGATGCAGAA GC AT C AAAAG 960 
M Q K H Q K 232 
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Figure 4 IB 



9 61 GTCCCGACTG CCGTCCTTCT GTCGGCGCTT G AGG AG AC GG CCTGGCTCTT CAACCTTCGA 102 0 
233 VPTA V L L SAL EETA WLF NLR 252 



1021 GCCAGTGACA TCCCCTATAA CCCCTTCTTC TATTCCTACA CGCTGCTCAC AGACTCTTCT 1080 
253 ASDI PYN PFF Y 'SYT LLT DSS 272 



10 81 ATTAGGTTGT TTGCAAACAA GAGTCGCTTT AGCTCCGAAA CCTTGAGCTA TCTGAACTCC 1140 
273 IRLF ANK SRF SSET LSY LNS 292 



1141 AGTTGCACAG GCCCCATGTG TGTGCAAATC GAGGATTACA GCCAAGTTCG TGACAGCATC 12 00 
293 SCTG PMC V Q I EDYS QVR DSI 312 



12 01 CAGGCCTACT CATTGGGAGA TGTGAGGATC TGGAT TGGGA CCAGCTATAC CATGTATGGG 12 60 
313 QAYS LGD VRI WIGT SYT MYG 332 



12 61 ATC T ATG AAA TGATACCAAG GGAGAAACTC GTGACAGACA CCTACTCCCC AGTGATGATG 1320 

Q 333 IYEM IPR EKL VTDT YSP VMM 352 

U»l . . ... 

jfj; 1321 AC C AAGGC AG TGAAGAACAG CAAGGAGCAG GCCCTCCTCA AGGCCAGCCA CGTGC GGGAC 13 80 

VI 353 TKAV KNS KEQ ALLK ASH VRD 372 



V! ...... 

ffj 13 81 GCTGTGGCTG TGATCCGGTA CTTGGTCTGG CTGGAGAAGA ACGTGCCCAA AGGCACAGTG 1440 

373 AVAV IRY L V W LEKN VPK GTV 3 92 

H ...... 

f\\ 1441 GATGAGTTTT C GGGGGC AG A G ATC GTGGAC AAGTTC CGAG GAGAAGAACA GTTCTCCTCC 1500 

Wi 393 DEFS GAE IVD KFRG EEQ FSS 412 

w ...... 

Q 1501 GGACCCAGTT TTGAAACCAT CTCTGCTAGT GGTTTGAATG CTGCCCTGGC CCACTACAGC 15 60 

Li 413 GPSF ETI SAS GLNA ALA HYS 432 



1561 CCGACCAAGG AGCTGAACCG CAAGCTGTCC TCAGATGAGA TGTACCTGCT GGACTCTGGG 162 0 
433 PTKE LNR KLS SDEM YLL DSG 452 



1621 GGGCAGTACT GGGAC GGGAC CACAGACATC AC C AGAAC AG TCCACTGGGG CACCCCCTCT 1680 
453 GQYW DGT TDI TRTV HWG TPS 472 



16 81 GCCTTTCAGA AGGAGGCATA TACCCGTGTG CTGATAGGAA ATATTGACCT GTCCAGGCTC 1740 
473 AFQK EAY TRV LIGN IDL SR L 492 



1741 ATCTTTCCCG CTGCTACATC AGGGCGAATG GTGGAGGCCT TTGCCCGCAG AGCCTTGTGG 1800 
493 IFPA ATS GRM VEAF ARR ALW 512 



1801 GATGCTGGTC TCAATTATGG TCATGGGACA GGCCACGGCA TTGGCAACTT CCTGTGTGTG 1860 
513 DAGL NYG HGT GHGI GNF LCV 532 
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Figure 41C 

18 61 CATGAGTGGC CAGTGGGATT CCAGTCCAAC AACATCGCTA TGGCCAAGGG CATGTTCACT 1920 
533 HEWP VGF QSN NIAM AKG MFT 552 

1921 TCCATTGAAC CTGGTTACTA TAAGGATGGA GAATTTGGGA TCCGTCTCGA AGATGTGGCT 1980 
553 SIEP GYY KDG EFGI RLE DVA 572 

19 81 CTCGTGGTAG AAGCAAAGAC CAAGTACCCA GGGGAGC T AC CTGACCTTGT GGTATCATTT 2 040 
573 L V V E AKT KYP GELP D L V VSF 592 

2 041 GTGCCCTATG ACCGGAACCT CATCGATGTC AGCCTGCTGT CTCCCGAGCA TCTCCAGTAC 2100 
593 VPYD RNL IDV SLLS PEH LQY 612 

2101 CTGAATCGCT ACTACCAGAC CATC C GGGAG AAGGTGGGTC CAGAGCTGCA GAGGCGCCAG 2160 
613 LNRY YQT IRE KVGP E L Q RRQ 632 

2161 CTACTAGAGG AGTTCGAGTG GCTTCAACAG CACACAGAGC CCCTGGCCGC CAGGGCCCCA 2 22 0 
633 L L E E FEW LQQ HTEP LAA RAP 652 

2221 GACACCGCCT CCTGGGCCTC TGTGTTAGTG GTCTCCACCC TTGCCATCCT TGGCTGGAGT 22 80 
653 DTAS WAS VLV VSTL AIL GWS 672 

22 81 GTCTAGAGGC TCCAGACTCT CCTGTTAACC C TC CATC TAG ATGGGGGGCT CCCTTGCTTA 2 3 40 
673 V * 673 

23 41 GCTCCCCTCA CCCTGCACTG AACATACCCC AAGAGCCCCT GCTGGCCCAT TGCCTAGAAA 2400 
2401 CCTTTGCATT CATCCTCCTT CTCCAAGACC TATGGAGAAG GTCCCAGGCC CCAGGAAACA 2460 
2 461 CAGGGCTTCT TGGCCCCAGA TGGCACCTCC CTGCACCCCG GGGTTGTATA CCACACCCTG 2 52 0 
2 521 GGCCCCTAAT CCCAGGCCCC GAAATAGGAA AGC C AGC TAG TCTCTTCTCT TCTGTGATCT 2 580 
2 581 CAGTAGGCCT AACCTATAAC C T AAC AC AG A CTGCTACAGC TGCTCCCCTC CCGCCAAACA 2 640 
2 641 AAGCCCCAAG AAAACAATGC CCCTACCACC CAAGGGTGCC ATGGTCCCGG GAAAACCCAA 2700 
27 01 CCTGTCACCG CGTGTTGGGC GTAACCAGAA CTGTTCCCCC CCACCAGGGC TTAAAAATCG 27 60 
27 61 CCCCCACTTT TTAACCATCG TCCATTAACC AC C TGGTGGG C AT AGC C AG A GCTGTTCGAA 2820 
2 821 CCCAGCCAGG GATGAAAAAT CAACCCCCGA CATGGAACCC ATGATTCCTA AACCCGGGGT 2 880 
2 881 AGGTTCCATG CCAAGTAACA GCAGAGGGAG TT AAGC CAT A GGAATTTGGC TGTGGAGTAA 2940 
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Figure 4 ID 

2 941 GAGGGAATGC GGTGAGGCAG TGTGGAATAT GACCCTACCA GAGGTTGGAG AACAAACTTG 3000 

3 001 GGCAGCCGGA ACCCGTCACT ATTTTAGATT CCTGGCATTC GAGGAGCCCT TTGAACTTTC 3 060 
3 061 CAAAGTGCAG CCACAGCTAC AATGCTGTTA AATCCTCCCA CATTTCTTGG ATGCCCCTTC 312 0 
3121 ACCTTGTGTG GACAGTGTCT GGTTTCCCCA TTTTACAGAC AGGAAAACTG AGCTTCAGAC 3180 
3181 AGGGGGTGGG CTTTGCCTAA GGACACACAA ATTTGGTTGG GAGTTGATGG GGCCAGATGA 3 240 
3241 GCCAGCATTC CAGCTGTTTC ACCCTTCAGC AACATGCAGA GTCCCTGAGC CCACCTCCCA 3 3 00 
3 3 01 GCCCTCTCCT CATTCTCTGA ACCCACTGTG GTGAGAAGAA TTTGCTCCGG CCAAATTGGC 33 60 
33 61 CGTTAGCCAC CTGGGTCCAC ATCCTGCTAA GACGTTTAAA ACAGCCTAAC AAAGACACTT 342 0 
3421 GCCTGTGG 3428 
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1 CTGTGCATGG CATCATCCTG 
1 M A S S W 



61 TTCCCTCAAA ATGCTACGGC 
19FPQN ATA 



121 GTGCTGCCGA CATTTATCAT 
3 9V L P T FII 



181 CTGTTGGTCT TCCTCCTGCC 
59LLVF LLP 



2 41 CTGGCAGCCT CTGATCTGGT 
79LAAS DLV 



3 01 AACCAGTTTA ACTGGCCTTT 
99NQFN WPF 



3 61 GCCAATTTGT TCATCAGCAT 
119 A N L F I S I 



421 CTGGTGCACC CTATGGCCAG 
139 L V H P MAS 



481 GTGCTCATCT GGGTTGTGGG 
159 V L I W V V G 



541 CAAGCCGTCC CAGATCTGAA 
179 Q A V P D L N 



601 CACTTTGCAA GGATTGTGGA 
199 H F A R I V E 



661 GTCTTCTTCA ACTACCACAT 

219 V F F N Y H I 

721 AGAGTGCAGG GGCCGAAGGA 

239 R V Q G P K D 



781 TTCCTGGTCT GCTGGGCCCC 
259 F L V C WAP 



841 CAAGCAGTCC GAGGCTGCTT 
279 Q A V R G C F 



Figure 42A 

GCCCCCTCTA GAGCTCCAAT 
PPL E L Q S 



CTGTGACAAT GCTCCAGAAG 
C D N A P E A 



CTCCATCTGT TTCTTCGGCC 
SIC F F G L 



CCGGCGGCAA CTGAACGTGG 
R R Q L N V A 



GTTTGTCTTG GGCTTGCCCT 
F V L G L P F 



CGGAGCCCTC CTCTGCCGTG 
GAL L C R V 



CTTCCTGGTG GTGGCCATCA 
F L V V A I S 



CGGAAGGCAG CAGCGGCGGA 
G R Q Q R R R 



GGGCCTCTTG AGCATCCCCA 
G L L S I P T 



CATCACCGCC TGCATCCTGC 
I T A C I L L 



GTTAAATATT CTGGGTTTCC 
L N I L G F L 



CCTGGCCTCC CTGCGAACGC 
LAS L R T R 



TAGCAAGACC ACAGCGCTGA 
S K T TALI 



TTACCACTTC TTTGCCTTCC 
Y H F F A F L 



TTGGGAGGAC TTCATTGACC 
WED F I D L 



CCTCCAACCA GAGCCAGCTC 60 
S N Q SQL 18 



CCTGGGACCT GCTGCACAGA 12 0 
W D L L H R 38 



TCCTAGGGAA CCTTTTTGTC 180 
L G N L F V 58 



CAGAAATCTA CCTGGCCAAC 240 
E I Y LAN 78 



TCTGGGCAGA GAATATCTGG 3 00 
W A E N I W 98 



TCATCAACGG GGTCATCAAG 3 60 
I N G V I K 118 



GCCAGGACCG CTACCGCGTG 420 
Q D R Y R V 138 



GGCAGGCCCG GGTCACCTGC 480 
Q A R V T C 158 



CATTCCTGCT GC GAT C CATC 540 
F L L RSI 178 



TCCTCCCCCA TGAGGCCTGG 600 
L P H E A W 198 



TCCTACCACT GGC TGCGATC 660 
L P L A A I 218 



GGGAGGAGGT CAGCAGGACA 72 0 
E E V S R T 238 



TCCTCACGCT CGTGGTTGCC 7 80 
L T L V V A 258 



TGGAATTCTT ATTCCAGGTG 840 
E F L F Q V 278 



TGGGC C TGC A ATTGGCCAAC 900 
G L Q LAN 298 
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Figure 42B 

9 01 TTCTTTGCCT TCACTAACAG CTCCCTGAAT CCAGTAATTT ATGTCTTTGT GGGCCGGCTC 960 
299 FFAF TNS SLN PVIY VFV GRL 318 

961 TTCAGGACCA AGGTC TGGG A ACTTTATAAA CAATGCACCC CTAAAAGTCT TGCTCCAATA 102 0 
319 FRTK VWE LYK QCTP KSL API 338 

1021 TCTTCATCCC ATAGGAAAGA AATCTTCCAA CTTTTCTGGC GGAATTAAAA CAGCATTGAA 1080 
339 SSSH RKE I F Q LFWR N * 353 



1081 CC 1082 
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Figure 43A 

1 ATGTTCTCTC CCTGGAAGAT ATCAATGTTT CTGTCTGTTC GTGAGGCCTC CGTGCCCACC 60 
1MFSP WKI SMF LSVR EAS VPT 20 

61 ACGGCCTCTT TCAGCGCCGA CATGCTCAAT GTCACCTTGC AAGGGC C C AC TCTTAACGGG 120 
21TASF SAD MLN VTLQ GPT LNG 40 

121 ACCTTTGCCC AGAGCAAATG CCCCCAAGTG GAGTGGC TGG GCTGGCTCAA CACCATCCAG 180 
41TFAQ SKC PQV EWLG WLN TIQ 60 

181 CCCCCCTTCC TCTGGGTGCT GTTCGTGCTG GCCACCCTAG AGAACATCTT TGTCCTCAGC 240 
61PPFL W V L F V L ATLE NIF VLS 80 

2 41 GTCTTCTGCC TGCACAAGAG CAGCTGCACG GTGGCAGAGA TCTACCTGGG GAACCTGGCC 3 00 
81VFCL HKS SCT VAEI YLG NLA 100 

3 01 GCAGCAGACC TGATCCTGGC CTGCGGGCTG CCCTTCTGGG CCATCACCAT CTCCAACAAC 3 60 
H 101 AADL I L A CGL PFWA ITI SNN 120 



m 



3 61 TTCGACTGGC TCTTTGGGGA GACGCTCTGC CGCGTGGTGA AT GC C AT TAT CTCCATGAAC 420 
121 FDWL FGE T L C RVVN All SMN 140 



421 CTGTACAGCA GCATCTGTTT CCTGATGCTG GTGAGCATCG ACCGCTACCT GGCCCTGGTG 480 
141 L Y S S ICF L M L VSID RYL ALV 160 



[Ufc 481 AAAACCATGT CCATGGGCCG GATGCGCGGC GTGCGCTGGG CCAAGCTCTA CAGCTTGGTG 540 

!! 3 !l 161 KTMS MGR MRG VRWA KLY SLV 180 

:: it 
'issi 

Ml 541 ATCTGGGGGT GTACGCTGCT CCTGAGCTCA CCCATGCTGG TGTTCCGGAC CATGAAGGAG 600 

•p 181 IWGC TLL LS S PMLV FRT MKE 200 

H 

601 TACAGCGATG AGGGC C AC AA CGTCACCGCT TGTGTCATCA GCTACCCATC CCTCATCTGG 660 

201 YSDE GHN VTA CVIS YPS LIW 220 



661 GAAGTGTTCA CCAACATGCT CCTGAATGTC GTGGGCTTCC TGCTGCCCCT GAGTGTCATC 72 0 
221 EVFT NML LNV VGFL LPL SVI 240 



721 ACCTTCTGCA CGATGCAGAT CATGCAGGTG CTGCGGAACA AC G AGATGC A GAAGTTCAAG 7 80 

241 T FCT MQI MQV LRNN EMQ KFK 260 

781 GAGATCCAGA CGGAGAGGAG GGCCACGGTG CTAGTCCTGG TTGTGCTGCT GCTATTCATC 840 

261 EIQT ERR A T V LVLV VLL LFI 280 

841 ATCTGCTGGC TGCCCTTCCA GATCAGCACC TTCC TGGAT A CGCTGCATCG CCTCGGCATC 900 

281 ICWL PFQ 1ST FLDT LHR LGI 300 
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Figure 43B 

901 CTCTCCAGCT GCCAGGACGA GCGCATCATC GATGTAATCA CACAGATCGC CTCCTTCATG 960 
301 Li S S C QDE RII DVIT Q I A SFM 320 

961 GCCTACAGCA ACAGCTGCCT CAACCCACTG GTGTACGTGA TCGTGGGCAA GCGCTTCCGA 1020 
321AYSN S C L NPL. VYVI VGK.RFR 340 

1021 AAGAAGTCTT GGGAGGTGTA C C AGGGAGTG TGC CAGAAAG GGGGCTGCAG GTCAGAACCC 1080 
341KKSW EVY QGV CQKG GCR SEP 360 

1081 ATTCAGATGG AGAACTCCAT GGGCACACTG CGGACCTCCA TCTCCGTGGA ACGCCAGATT 1140 
361IQME NSM GTL RTSI SVE RQI 380 

1141 CACAAACTGC AGGACTGGGC AGGGAGCAGA CAGTGAGCAA ACGCCAGCAG GGCTGCTGTG 1200 
381 HKLQ DWA GSR Q* 

12 01 AATTTGTGTA AGGATTGAGG GACAGTTGCT TTTCAGCATG GGCCCAGGAA TGCCAAGGAG 12 60 

12 61 ACATCTATGC ACGACCTTGG GAAATGAGTT GATGTCTCCG GTAAAACACC GGAGACTAAT 132 0 
1321 TCCTGCCCTG CCCAATTTTG CAGGGAGCAT GGCTGTGAGG ATGGGGTGAA CTCACGCACA 1380 

13 81 GCCAAGGACT CCAAAATCAC AACAGCATTA CTGTTCTTAT TTGCTGCCAC ACCTGAGCCA 1440 
1441 GCCTGCTCCT TCCCAGGAGT GGAGGAGGCC TGGGGGGAGG GAGAGGAGTG ACTGAGCTTC 1500 
1501 CCTCCCGTGT GTTCTCCGTC CCTGCCCCAG CAAGACAACT TAGATCTCCA GGAGAACTGC 1560 
1561 CATCCAGCTT TGGTGCAATG GCTGAGTGCA CAAGTGAGTT GTTGCCCTGG GTTTCTTTAA 1620 
1621 TCTATTCAGC TAGAACTTTG AAGGACAATT TCTTGCATTA ATAAAGGTTA AGCCCTGAGG 1680 
1681 GGTCCCTGAT AACAACCTGG AGACCAGGAT TTTATGGCTC CCCTCACTGA TGGACAAGGA 1740 
1741 GGTCTGTGCC AAAGAAGAAT CCAATAAGCA CATATTGAGC ACTTGCTGTA TATGCAGTAT 1800 
1801 TGAGC AC TGT AGGCAAGACC CAAGAAAGAG AAGGAGCCAT CTCCATCTTG AAGGAACTCA 1860 
1861 AAGACTCAAG TGGGAACGAC TGGGCACTGC CACCACCAGA AAGCTGTTCG ACGAGACGGT 1920 
1921 CGAGCAGGGT GCTGTGGGTG ATATGGACAG CAGAAGGGGG AGACCAAGGT TCCAGCTCAA 1980 
1981 CC AATAAC T A TTGCAC AAC C ACCTGTCCCT GCCTCAGTTC CCTTTTATGT AACATGAAGT 2040 
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Figu: 

2 041 CGTTGTGAGG GTTAAAGGCA GTAACAGGTA 
2101 GTACATGTGA GGCATCATTA CGCAGACGTA 
2161 ACTGAGGTCT AGAAATAGCT CCGTGGAGCA 
2221 AAGCACCAGT GTCTGGCACA CAGTAGGTGC 
2281 CCACCCTGAG GCCCCAACCG CCACACACAC 
2 341 AAAGTC T GAT TTGTGATGAG GCAGAGGAAG 
2401 AC AGTGC TGA GACCCCCCAC CACCAGCCGG 
2461 TCAGGGACTG TTCCTGTCTC AGCAACCAAG 
2521 GAAGGTGGCC CAGTATGAGC CCTAGAAGAG 
2 581 TCGGCAGTGC CAGGGCAGCA CTCATTCACT 
2641 GAGCTAGAAC CTGGAGAGCT AGAACCTGGA 
2701 GAGGCTAGAA CCAAGAAGGG CTAGAACCTG 
2761 TGAGCTAGAA GCTGGAGGAC TAGAACCTGG 
2 821 AGGGCTGGAA TCTGGAGAGC TAGAACCTGG 
2 881 AAGGGCTAGA AC C TGGAGGG CTGGAATCTG 

2 941 GAGGGCTAGA ACCTAGAAGG GCTAGAACCT 

3 001 AGAAGGGCTA GAACCTGGAG AGCCAGAACC 
3 061 CTGTAGAGCT AGAACATGGA GAGCTAGAAC 
3121 CTGGAGGGAA TGAACCTGGA GGGCTAGAAC 
3181 GAGCCCATAA ATCCTGACCA ATCCAACTCT 



e 43C 

TAAAGTACTT AGAAAAGCAA AGGGTGC T AC 2100 
AC TGGG AT AT GTTTACTATA AGGAAAAGAC 2160 
GAATCAGTAT TGGGAGCCGG TGGCGGTGTG 222 0 
TCATTGGCTC CCTTCCACCT GTCATTCCCA 2280 
AGGAGCATTT GGAGAGAAGG CCATGTCTTC 2 340 
ATATTTCTAA TCGGTCTTGC CCAGAGGATC 2400 
TACCTGGGAA GGGGGAGAGT GCAGGCCTGC 2460 
GGATTGTTCC TGTCAATCAA TGGTTTATTG 252 0 
TGTGAAAAGG AATGGCAATG GTGTTCACCA 2 580 
TGATAAATGA ATATTTATTA GCTGGTTGGA 2 640 
GAACTAGAAC CTGGAGGGCT AGAACCTGGA 2700 
GAGGGGCTAG AACCTAGAGA AGCTAAAACC 2760 
AGGGCTGGAA TCTGAAGGGC TAGAACCTGG 2 82 0 
AGGGCTAGAA CCTGGAGGGC TAGAACCTAG 2 880 
GAGAGCTAGA AC C TGGAGGG CTAGAACCTG 2 940 
GGAGGGC TAG AACCTGGCAG GTTAGAACCT 3000 
TGGAGGGCTA GAACCTGGAA GGGCTAGAAC 3 060 
CCGGCAGGCT AGAAC C TGGC AAGCTAGAAC 312 0 
CTGGAGAATG AGAAAAATTT ACATGGCAAA 3180 
GAATTTTAAA GCAAAAGCGT GAAAAAAAAG 3 240 
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Figure 43D 

32 41 ATTCCCTCCT TACCCCCAAC CCACTCTTTT TTCCCACCAC CCACTCTCCT CTGCCTCAGT 33 00 

33 01 AAGTATCTGG AGGAAGAAAA CAGGTGAAAG AAGAAGTAAA AACCATTTAG TATTAGTATT 33 60 
3 3 61 AGAATGAAGT CAAACTGTGC CACACATGGT GAATGAAAAA AAAAAAAAAG AGGCTGTGTT 342 0 
3421 TTGTCACACA GGGCAGTCAT TCAGCACCAG AGCACGTGAT GGTCTGAGAC TCTCTTAGGA 3480 
3481 GCAGAGCTCT GCCGCAATGG CCATGTGGGG ATCCACACCT GGTC TGAGGG GCAACTGAGT 3 540 
3 541 CTGCGGGAGA AGAGCGGCCC TATGCATGGT GTAGATGCCC TGATAAAGAA CATCTGTCCT 3 600 
3 601 GTGAAAG AC T CAATGAGCTG TTATGTTGTA AACAGGAAGC ATTTCACATC CAAACGAGAA 3660 
3 661 AATCATGTAA ACATGTGTCT TTTCTGTAGA GCATAATAAA TGGATGAGGT TTTTGCAAAA 372 0 
3721 AAAAAAAAAA AAA 3733 
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Figure 44A 

1 ATGCATCTTATCGACTACCTGCTCCTCCTGCTGGTTGGACTACTGGCCCTTTCTCATGGC 60 

1MHLIDYLLLLLVGLLALSHG 20 

6 1 CAGCTGCACGTTGAGCATGATGGTGAGAGTTGCAGTAACAGCTCCCACCAGCAGATTCTG 12 0 

21QLHVEHDGESCSNSSHQQIL 40 

121 GAGACAGGTGAGGGCTCCCCCAGCCTCAAGATAGCCCCTGCCAATGCTGACTTTGCCTTC 180 

41 ETGEGSPSLKIAPANADFAF 60 

181 CGCTTCTACTACCTGATCGCTTCGGAGACCCCGGGGAAGAACATCTTTTTCTCCCCGCTG 240 

61 RFYYLIASETPGKNIFFSPL 80 

241 AGCATGTCGGCGGCCTACGCCATGCTTTCCCTGGGGGCCTGCTCACACAGCCGCAGCCAG 3 00 

81 SI SAAYAMLSLGACSHSRSQ 100 

3 01 ATCCTTGAGGGCCTGGGCTTCAACCTCACCGAGCTGTCTGAGTCCGATGTCCATAGGGGC 3 6 0 

101 ILEGLGFNLTELSESDVHRG 120 



Ijjl 3 61 TTCCAGCACCTCCTGCACACTCTCAACCTCCCCGGCCATGGGCTGGAAACACGCGTGGGC 42 0 

121 FQHLLHTLNLPGHGLETRVG 140 



421 AGTGCTCTGTTCCTGAGCC ACAACCTGAAGTTCCTTGCAAAATTCCTGAATGAC ACCATG 480 

141 SALFLSHNLKFLAKFLNDTM 160 

481 GCCGTCTATGAGGCTAAACTCTTCCACACCAACTTCTACGACACTGTGGGCACAATCCAG 540 

161 AVYEAKLFHTNFYDTVGTIQ 180 

541 CTTATCAACGACCACGTCAAGAAGGAAACTCGAGGGAAGATTGTGGATTTGGTCAGTGAG 600 

181 LINDHVKKETRGKIVDLVSE 200 

601 CTCAAGAAGGACGTCTTGATGGTGCTGGTGAATTACATTTACTTCAAAGCCCTGTGGGAG 660 

201 LKKDVLMVLVNYIYFKALWE 220 

661 AAACCATTCATTTCCTCAAGGACCACTCCCAAAGACTTCTATGTTGATGAGAACACAACA 72 0 

221 KPF I SSRTTPKDFYVDENTT 240 

721 GTCCGGGTGCCCATGATGCTGCAGGACCAGGAGCATCACTGGTATCTTCATGACAGATAC 7 80 

241 VRVPMMLQDQEHHWYLHDRY 260 



7 81 TTGCCCTGCTCGGTGCTACGGATGGATTACAAAGGAGACGCAACCGTGTTTTTCATTCTC 840 
261 LPCSVLRMDYKGDATVFFIL 280 
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Figure 44B 



841 CCTAACCAAGGCAAAATGAGGGAGATTGAAGAGGTTCTGACTCCAGAGATGCTAATGAGG 900 
281 PNQGKMREI EEVLTPEMLMR 300 



901 TGGAACAACTTGTTGCGGAAGAGGAATTTTTACAAGAAGCTAGAGTTGCATCTTCCCAAG 960 
301 WNNLLRKRNF YKKLELHLPK 320 



9 61 TTCTCCATTTCTGGCTCCTATGTATTAGATCAGATTTTGCCCAGGCTGGGCTTCACGGAT 102 0 

321 FS I SGSYVLDQILPRLGFTD 340 

1021 CTGTTCTCC AAGTGGGCTGACTTATCCGGC ATCACCAAACAGC AAAAACTGGAGGC ATCC 10 80 

341 LFSKWADLSGITKQQKLEAS 360 

1081 AAAAGTTTCC AC AAGGCC ACCTTGGACGTGGATGAGGCTGGC ACCGAGGCTGCAGCAGCC 114 0 

361 KSFHKATLDVDEAGTEAAAA 380 

1141 ACGACGTTCGCGATCAAATTCTTCTCTGCCCAGACCAATCGCCACATCCTGCGATTCAAC 1200 

381 TTFAIKFFSAQTNRHILRFN 400 

1201 CGGCCCTTCCTTGTGGTGATCTTTTCCACCAGCACCCAGAGTGTCCTCTTTCTGGGCAAG 12 60 

401 RPFLVVIFSTSTQSVLFLGK 420 



12 61 GTCGTCGACCCCACGAAACCATAG 12 84 
421 VVDPTKP* 428 
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Figure 45A 



1 ATGCATCTTATCGACTACCTGCTCCTCCTGCTGGTTGGACTACTGGCCCTTTCTCATGGC 60 
1MHLIDYLLLLLVGLLALSHG 20 



6 1 CAGCTGCACGTTGAGCATGATGGTGAGAGTTGCAGTAACAGCTCCCACCAGCAGATTCTG 12 0 
21QLHVEHDGESCSNSSHQQIL 40 



121 GAGACAGGTGAGGGCTCCCCCAGCCTCAAGATAGCCCCTGCCAATGCTGACTTTGCCTTC 18 0 
41 ETGEGSPSLKIAPANADFAF 60 



181 CGCTTCTACTACCTGATCGCTTCGGAGACCCCGGGGAAGAACATCTTTTTCTCCCCGCTG 240 
61RFYYLIASETPGKNIFFSPL 80 



241 AGCATCTCGGCGGCCTACGCCATGCTTTCCCTGGGGGCCTGCTCACACAGCCGCAGCCAG 3 00 
81 S I SAAYAMLSLGACSHSRSQ 100 



3 01 ATCCTTGAGGGCCTGGGCTTCAACCTCACCGAGCTGTCTGAGTCCGATGTCCATAGGGGC 3 6 0 
101 ILEGLGFNLTELSESDVHRG 120 



ifj 3 61 TTCCAGCACCTCCTGCACACTCTCAACCTCCCCGGCCATGGGCTGGAAACATGCGTGGGC 42 0 

$ 121 FQHLLHTLNLPGHGLETCVG 140 

Cfi ...... 

421 AGTGCTCTGTTCCTGAGCCACAACCTGAAGTTCCTTGCAAAATTCCTGAATGACACCATG 48 0 

!: . 141 SALFLSHNLKFLAKFLNDTM 160 

ry 

CJ 481 GCCGTCTATGAGGCTAAACTCTTCCACACCAACTTCTACGACACTGTGGGCACAATCCAG 540 

yj 161 AVYEAKLFHTNFYDTVGTIQ 180 

Q 

I- ...... 

541 CTTATCAACGACCACGTCAAGAAGGAAACTCGAGGGAAGATTGTGGATTTGGTCAGTGAG 600 

181 LINDHVKKETRGKIVDLVSE 200 

601 CTCAAGAAGGACGTCTTGATGGTGCTGGTGAATTACATTTACTTCAAAGCCCTGTGGGAG 660 

201 LKKDVLMVLVNYIYFKALWE 220 



661 AAACCATTCATTTCCTCAAGGACCACTCCCAAAGACTTCTATGTTGATGAGAACAC AACA 72 0 
221 KPFI SSRTTPKDFYVDENTT 240 



721 GTCCGGGTGCCCATGATGCTGCAGGACCAGGAGCATCACTGGTATCTTCATGACAGATAC 7 80 
241 VRVPMMLQDQEHHWYLHDRY 260 



7 81 TTGCCCTGCTCGGTGCTACGGATGGATTACAAAGGAGACGCAACCGTGTTTTTCATTCTC 840 
261 L P C SVLRMDYKGDATVFF I L 2 80 
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Figure 45B 



841 C C T AAC C AAGGC AAAATG AGGG AG ATTG AAGAGGTT C TGAC TC C AG AG ATGC T AATGAGG 
281 PNQGKMRE I E EVLT P EMLMR 



901 TGGAACAACTTGTTGCGGAAGAGGAATTTTTACAAGAAGCTAGAGTTGCATCTTCCCAAG 
301 WNNLLRKRNFYKKLELHLPK 



9 61 TTCTCCATTTCTGGCTCCTATGTATTAGATCAGATTTTGCCCAGGCTGGGCTTCACGGAT 
321 FSISGSYVLDQILPRLGFTD 



1021 CTGTTCTCCAAGTGGGCTGACTTATCCGGCATCACCAAACAGCAAAAACTGGAGGCATCC 
341 LFSKWADLSGITKQQKLEAS 



1081 AAAAGTTTCCACAAGGCCACCTTGGACGTGGATGAGGCTGGCACCGAGGCTGCAGCAGCC 
361 KSFHKATLDVDEAGTEAAAA 



1141 ACCACGTTCGCGATCAAATTCTTCTCTGCCCAGACCAATCGCCACATCCTGCGATTCAAC 
381 TTFAIKFFSAQTNRHILRFN 



12 01 CGGCCCTTCCTTGTGGTGATCTTTTCCACCAGCACCCAGAGTGTCCTCTTTCTGGGCAAG 
401 RPFLVVIFSTSTQSVLFLGK 



12 61 GTC GTC GAC C C C AC GAAAC C ATAG 12 84 
421 VVDPTKP* 428 
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1 CACCCTATCC TACACTACTA 



61 TGCCCACCCT GGCTCCCAAA 



121 GCAGCCAAAC GCCTCCTTCT 



181 AGGCCTGAAG GAACAGGCCG 



241 CCAGCGCCGG CAT C TGGAGA 
1 

3 01 GTCCTCCTCT GTGCTTGTGC 
13VLLC ACA 



361 GTGAGAAATT GTTCCACCAA 
33VRNC STN 



421 TCACTCACAG CCCTCCGCCA 
53SLTA LRQ 

481 GGCACAGATG CTCACATGAA 
73GTDA HMN 



541 ACAGGCTTTA CAGGGTCTGC 
93TGFT GSA 



601 ACCGACAGTC GCTACTGGAC 
113 T D S R Y W T 



6 61 AAGGAAGTTG GCACCACTCC 
133 K E V G T T P 



721 CGTGTGGGTT TTGACCCCTT 
153 R V G F D P F 



781 CTCCAAGGCT C T AAC AG AC A 
173 L Q G S N R Q 



841 GGATCAGAGA GGCCACCGGT 
193 G S E R P P V 



901 GGGAGCACTT GGCAGGAGAA 
213 G S T W Q E K 



Figure 4 6A 



GGAACTTGCA CAGTCCGCCT 
ACCCTCCAAA ACAAAAGACC 



TGACGCCAGC CCCCACCCTC 



GGGAAGGAGC CCTCCCTCTC 



CCCTATGGCC CGGGCTCACT 

MA R A H W 

C TGGGGC C AC AC AAAGC C AC 

W G H T K P L 



CCCCCCTTAC CTTCCAGTTA 
P P Y L P V T 



GCAGATGCAG ACCCAGAATC 
Q M Q T Q N L 



CGAGTACATC GGCCAACATG 
E Y I G Q H D 



AGGAAC TGC A GTGGTGACTA 
G T A V V T M 



TCAGGCTGAG CGGCAAATGG 
Q A E R Q M D 



TATTGTCACC TGGCTCCTCA 
I V T W L L T 



CCTCTTGTCC AT TGAC AC C T 
L L S I D T W 



GCTGGTGTCC ATCACAACCA 
L V S I T T N 



TCCAAATCAA CCCATTTATG 
P N Q P I Y A 



AGTATCTGGC GTCCGAAGCC 
V S G V R S Q 



CGGGCAGCCC AAAGC TCCTC 60 
AGAAAAGCAC TCTCCACCCA 12 0 



TGTCTGCTCG AGCCCAGGAA 180 



TCCCTTGTCC CTCCATCCAC 240 



GGGGCTGCTG CCCCTGGCTG 300 
G C C P W L 12 



TGGACCTTGG AGGGCAGGAT 3 60 
D L G G Q D 32 



CTGTGGTCAA TACCACAATG 42 0 
V V N T T M 52 



TCTCAGCCTA CATCATCCCA 480 
SAY IIP 72 



AC G AG AGGC G TGCGTGGATT 540 
ERR A W I 92 



TGAAGAAAGC AGCTGTCTGG 600 
K K A A V W 112 



ACTGTAATTG GGAGCTCCAT 660 

C N W E L H 132 

CCGAGATTCC NGC TGGAGGG 72 0 

E I P A G G 152 



GGGAGAGTTA TGATCTGGCC 780 
E S Y D L A 172 



ATCTTGTGGA CCTGGTATGG 840 
LVD L V W 192 



CCCTGCAGGA GGCATTCACA 90 0 
L Q E AFT 212 



AGATGCAGAA GCATCAAAAG 960 
M Q K H Q K 232 
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961 GTCCCGACTG 
233 V P T A 



1021 GCCAGTGACA 
253 A S D I 



10 81 ATTAGGTTGT 
273 I R L F 



1141 AGTTGCACAG 
293 S C T G 



12 01 CAGGCCTACT 
313 Q A Y S 



H 12 61 AT C T AT G AAA 

Q 333 I Y E M 



W 13 21 ACCAAGGCAG 

Ul 353 T K A V 

i! r f • 

%l 13 81 GCTGTGGCTG 
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Figure 46B 



CCGTCCTTCT GTCGGCGCTT G AGG AG AC GG 
V Li L SAL EETA 



TCCCCTATAA CCCCTTCTTC TATTCCTACA 
PYN PFF YSYT 



TTGCAAACAA GAGTCGCTTT AGCTCCGAAA 
ANK SRF SSET 



GCCCCATGTG TGTGCAAATC GAGGATTACA 
PMC VQI EDYS 



C AT TGGGAG A TGTGAGGATC TGGATTGGGA 
LGD VRI WIGT 



TGATACCAAG GGAGAAACTC GTGACAGACA 
IPR EKL VTDT 



TGAAGAACAG CAAGGAGCAG GCCCTCCTCA 
KNS KEQ ALLK 



TGATCCGGTA CTTGGTCTGG CTGGAGAAGA 
IRY LVW LEKN 



CGGGGGCAGA GATCGTGGAC AAGTTCCGAG 
GAE IVD KFRG 



TTGAAACCAT CTCTGCTAGT GGTTTGAATG 
ETI SAS GLNA 



AGCTGAACCG CAAGCTGTCC TCAGATGAGA 
LNR KLS SDEM 



GGGAC GGGAC C AC AG AC AT C ACCAGAACAG 
DGT TDI TRTV 



AGGAGGCATA TACCCGTGTG CTGATAGGAA 
EAY TRV LIGN 



CTGCTACATC AGGGCGAATG GTGGAGGCCT 
ATS GRM VEAF 




CCTGGCTCTT CAACCTTCGA 102 0 
W L F N L R 252 



CGCTGCTCAC AGACTCTTCT 1080 
L L T D S S 272 



CCTTGAGCTA TCTGAACTCC 1140 
L S Y L N S 292 



GCCAAGTTCG TGACAGCATC 12 00 
Q V R D S I 312 



CCAGCTATAC CATGTATGGG 12 60 
S Y T M Y G 332 



CCTACTCCCC AGTGATGATG 1320 
Y S P VMM 352 



AGGCCAGCCA C GTGC GGGAC 13 80 
ASH V R D 372 



ACGTGCCCAA AGGCACAGTG 1440 
V P K G T V 392 



GAGAAGAACA GTTCTCCTCC 1500 
E E Q F S S 412 



CTGCCCTGGC CCACTACAGC 1560 
ALA H Y S 432 



TGTACCTGCT GGACTCTGGG 162 0 
Y L L D S G 452 



TCCACTGGGG CACCCCCTCT 1680 
H W G TPS 472 



AT ATTGAC CT GTCCAGGCTC 1740 
I D L S R L 492 



TTGCCCGCAG AGCCTTGTGG 1800 
A R R A L W 512 



1801 GATGCTGGTC TCAATTATGG TC ATGGG AC A GGCCACGGCA TTGGCAACTT CCTGTGTGTG 1860 
513 DAGL NYG HGT GHGI GNF LCV 532 



D0053 NP 



Figure 46C 

1861 CATGAGTGGC C AGTGGG AT T CCAGTCCAAC AACATCGCTA TGGCCAAGGG CATGTTCACT 192 0 
533 HEWP VGF QSN NIAM AKG MFT 552 

1921 TCCATTGAAC CTGGTTACTA TAAGGATGGA GAAT TTGGGA TCCGTCTCGA AG ATGTGGC T 1980 
553 SIEP GYY KDG EFGI RLE DVA 572 

1981 CTCGTGGTAG AAGCAAAGAC CAAGTACCCA GGGGAGC T AC CTGACCTTGT GGTATCATTT 2 040 
573 L V V E AKT KYP GELP DLV VSF 592 

2041 GTGCCCTATG AC CGGAAC C T CATCGATGTC AGCCTGCTGT CTCCNGAGCA TCTCCAGTAC 2100 
593 VPYD RNL IDV SLLS PEH LQY 612 

2101 CTGAATCGCT AC T AC C AGAC CATCCGGGAG AAGGTGGGTC CAGAGCTGCA GAGGCGC C AG 2160 
613 LNRY YQT IRE KVGP ELQ RRQ 632 

2161 CTACTAGAGG AGTTCGAGTG GCTTCAACAG CACACAGAGC CCCTGGCCGC CAGGGCCCCA 222 0 
633 LLEE FEW LQQ HTEP LAA RAP 652 

2221 GACACCGCCT CCTGGGCCTC TGTGTTAGTG GTCTCCACCC TTGCCATCCT TGGC TGGAGT 22 80 
653 DTAS WAS VLV VSTL AIL GWS 672 

22 81 GTCTAGAGGC TCCAGACTCT CCTGTTAACC CTCCATCTAG ATGGGGGGCT CCCTTGCTTA 2 340 
673 V * 673 

2341 GCTCCCCTCA CCCTGCACTG AACATACCCC AAGAGCCCCT GC TGGC C C AT TGCCTAGAAA 2400 

2 401 CCTTTGCATT CATCCTCCTT CTCCAAGACC TATGGAGAAG GTCCCAGGCC CCAGGAAACA 2460 

2461 CAGGGCTTCT TGGCCCCAGA TGGCACCTCC CTGCACCCCG GGGTTGTATA CCACACCCTG 2520 

2 521 GGCCCCTAAT CCCAGGCCCC GAAATAGGAA AGC C AGC TAG TCTCTTCTCT TCTGTGATCT 2580 

2 5 81 CAGTAGGCCT AACCTATAAC C T AAC AC AG A CTGCTACAGC TGCTCCCCTC CCGCCAAACA 2640 

2 641 AAGCCCCAAG AAAACAATGC CCCTACCACC CAAGGGTGCC ATGGTCCCGG GAAAACCCAA 27 00 

27 01 CCTGTCACCG CGTGTTGGGC GTAACCAGAA CTGTTCCCCC CCACCAGGGC TTAAAAATCG 27 60 

27 61 CCCCCACTTT TT AAC CATC G TCCATTAACC ACCTGGTGGG C AT AGC C AG A GCTGTTCGAA 2820 

2 821 CCCAGCCAGG GATGAAAAAT CAACCCCCGA CATGGAACCC ATGATTCCTA AACCCGGGGT 2880 

2 8 81 AGGTTCCATG CCAAGTAACA GCAGAGGGAG TTAAGCCATA GGAATTTGGC TGTGGAGTAA 2 940 
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Figure 46D 

2941 GAGGGAATGC GGTGAGGCAG TGTGGAATAT GACCCTACCA GAGGTTGGAG AACAAACTTG 3000 
3001 GGCAGCC GGA ACCCGTCACT ATTTTAGATT CCTGGCATTC GAGGAGCCCT TTGAACTTTC 3 060 
3 061 CAAAGTGCAG CCACAGCTAC AATGCTGTTA AATCCTCCCA CATTTCTTGG ATGCCCCTTC 312 0 
3121 ACCTTGTGTG GACAGTGTCT GGTTTCCCCA TTTTACAGAC AGGAAAACTG AGCTTCAGAC 3180 
3181 AGGGGGTGGG CTTTGCCTAA GGACACACAA ATTTGGTTGG GAGTTGATGG GGCCAGATGA 3240 
3241 GCCAGCATTC CAGCTGTTTC ACCCTTCAGC AACATGCAGA GTCCCTGAGC CCACCTCCCA 3300 
3301 GCCCTCTCCT CATTCTCTGA ACCCACTGTG GTGAGAAGAA TTTGCTCCGG CCAAATTGGC 3360 
p 3361 CGTTAGCCAC CTGGGTCCAC ATCCTGCTAA GACGTTTAAA ACAGCCTAAC AAAGACACTT 3420 
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P 3421 GCCTGTGG 342 8 
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